WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



r-n 
oo 
oo 

r~- 



(51) International Patent Classification 6 : 

C12N 15/86, 15/40, 7/OJ, 15/79, 5/10, 
A61K 48/00 



A2 



(II) International Publication Number: WO 99/18226 

(43) International Publication Date: 15 April 1999 (15.04.99) 



(21) International Application Number: PCT/US98/2 1 062 

(22) International Filing Date: 6 October 1998 (06.10.98) 



(30) Priority Data: 

08/944,465 



6 October 1997 (06.10.97) 



US 



(71) Applicants: CHIRON CORPORATION [US/US]; 4560 Horton 

Street, Emeryville, CA 94608-2916 (US). WASHINGTON 
UNIVERSITY [US/US]; One Brookings Drive, St. Louis, 
MO 63130 (US). 

(72) Inventors: DUBENSKY, Thomas, W., Jr.; P.O. Box 802, Del 

Mar, CA 92014 (US). POLO, John, M.; 221 Witham Road, 
Encinitas, CA 92024 (US). BELLI, Barbara, A.; 5850 De- 
spejo Place, San Diego, CA 92124 (US). SCHLESINGER, 
Sondnt; 6320 McPherson Street, St. Louis, MO 63 1 30 (US). 
DRYGA, Sergey, A.; 1307 Casa Grande Boulevard, Fort 
Collins, CO 80525 (US). FROLOV, llya; 200 Tanglewood, 
Sr. Louis, MO 63124 (US). 

(74) Agents: MC MASTERS, David, D. et al.; Seed and Berry 
LLP, 6300 Columbia Center, 701 Fifth Avenue, Seattle, WA 
)8 104-7092 (US). 



(81) Designated States: AU, CA, JP, European patent (AT, BE, 
CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE). 



Published 

Without international search report arid to be republished 
upon receipt of that report. 



(54) Title: RECOMBINANT ALPHA VIRUS-BASED VECTORS WITH REDUCED INHIBITION OF CELLULAR MACROMOLF.C- 
ULAR SYNTHESIS 



(57) Abstract 



Isolated nucleic acid molecules are disclosed, comprising an alphavirus nonstinctural protein gene which, when operably incorporated 
into a recombinant alphavirus particle, eukaryotic layered vector initiation system, or RNA vector rep] icon, has a reduced level of 
vector-specific RNA synthesis, as compared to wild-type, and the same or greater level of proteins encoded by RNA transcribed from 
the viral junction region promoter, as compared to a wild-type recombinant alphavirus particle. Also disclosed are RNA vector replicons, 
alphavirus vector constructs, and eukaryotic layered vector initiation systems which contain the above-identified nucleic acid molecules. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT, 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


CA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


\'L 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


HA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Motdova 


TG 


Togo 


IS If 


Barbados 


Gli 


Ghana 


mc; 


Madagascar 


TJ 


Tajikistan 


UK 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


nr 


Rnrkiua Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


fit 


Hungary 


ML 


Mali 


'IT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


UK 


IJiazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


HV 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mex ico 


vz 


Uzbekistan 


CF 


Centra! African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


cc 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


Cil 


Swiiierland 


KG 


Kyrgyzsian 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cflrc d'l voire 


KP 


Democratic People's 


NZ 


New Zealand 






Cm 


Cameroon 




Republic of Korea 


PL 


Poland 






CIS- 


Cliina 


KR 


Republic, of Korea 


PT 


Portugal 






C' u 


Cuba 


KZ 


Ka?.okstan 


KO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






f)K 


Germany 


U 


Liechtenstein 


SI) 


Sudan 






OK 


Denmark 


LK 


Sri Lanka 


SH 


Sweden 






EH 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/18226 



PCT/US98/21062 



t 

Description 

RECOMBINANT ALPHA VIRUS-BASED VECTORS WITH REDUCED 
INHIBITION OF CELLULAR MACROMOLECULAR SYNTHESIS 

5 

Cross-Reference to Related Application 

This application is a continuation-in-part of copending U.S. Patent 
Application No. 833.148. filed April 4. 1997; which is a continuation-in-part of U.S. 
continuation-in-part of U.S. Patent Application No. 08/679,640. filed July 12, 1996; 
10 which is a continuation-in-part of U.S. Patent Application No. 08/668.953 filed June 24, 
1996. which is a continuation-in-part of U.S. Patent Application No. 08/628,594. filed 
April 5. 1996, all of which are incorporated herein in their entirety. 

Statement of Government Interest 
15 This invention has been made in part with government support under 

grant number AI 1 1 377. awarded by the National Institutes of Health. The government 
may have certain rights in the invention. 

Technical Field of the Invention 
20 The present invention relates generally to recombinant DNA technology; 

and more specifically, to the development of recombinant vectors useful for directing 
the expression of one or more heterologous gene products. 

Background of the Invention 
25 Alphaviruses comprise a set of genetically, structurally, and serologically 

related arthropod-borne viruses of the Togaviridae family. These viruses are distributed 
worldwide, and persist in nature through a mosquito to vertebrate cycle. Birds, rodents, 
horses, primates, and humans are among the defined alphavirus vertebrate 
reservoir/hosts. 
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Twentv-six known viruses and virus subtypes have been classified 
within the alphavirus genus utilizing the hemagglutination inhibition (HI) assay. This 
assay segregates the 26 alphaviruses into three major complexes: the Venezuelan 
equine encephalitis (VEE) complex, the Semliki Forest (SF) complex, and the western 
5 equine encephalitis (WEE) complex. In addition, four other viruses, eastern equine 
encephalitis (EEE). Barmah Forest, Middelburg, and Ndumu, receive individual 
classification based on the HI serological assay. 

Members of the alphavirus genus also are classified based on their 
relative clinical features in humans: alphaviruses associated primarily with 

10 encephalitis, and alphaviruses associated primarily with fever, rash, and polyarthritis. 
Included in the former group are the VEE and WEE complexes, and EEE. In general, 
infection with this group can result in permanent sequelae, including behavior changes 
and learning disabilities, or death. In the latter group is the SF complex, comprised of 
the individual alphaviruses Semliki Forest, Sindbis, Ross River, Chikungunya, 

15 O'nyone-nyong, and Mayaro. With respect to this group, although serious epidemics 
have been reported, infection is in general self-limiting, without permanent sequelae. 

Sindbis virus is the prototype member of the Alphavirus genus of the 
Togaviridae family. Its replication strategy after infection of cells {see Figure 1) has 
been well characterized in chicken embryo fibroblasts (CEF) and baby hamster kidney 

20 (BHK) cells, where Sindbis virus grows rapidly and to high titer, and serves as a model 
for other alphaviruses. Briefly, the genome from Sindbis virus (like other alphaviruses) 
is an approximately 12 kb single-stranded positive-sense RNA molecule which is 
capped and polyadenylated, and contained within a virus-encoded capsid protein shell. 
The nucleocapsid is further surrounded by a host-derived lipid envelope into which two 

25 viral-specific glycoproteins, El and E2, are inserted and anchored to the nucleocapsid. 
Certain alphaviruses {e.g., SF) also maintain an additional protein, E3, which is a 
cleavage product of the E2 precursor protein, PE2. After virus particle absorption to 
target cells, penetration, and uncoating of the nucleocapsid to release viral genomic 
RNA into the cytoplasm, the replicative process is initiated by translation of the 

30 nonstructural proteins (nsPs) from the 5' two-thirds of the viral genome. The four nsPs 
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(nsPl-nsP4) are translated directly from the genomic RNA template as one of two 
polyprotems (nsP123 or nsPI234), and processed post-translationally into monomelic 
units by an active protease in the C-terminal domain nsP2. A leaky opal (UGA) codon 
present between nsP3 and nsP4 of most alphaviruses accounts for a 10 to 20% 
5 abundance of the nsP1234 polyprotein. as compared to the nsP123 poiyprotein. Both of 
the nonstructural polyproteins and their derived monomeric units may participate in the 
RNA replicative process, which involves binding to the conserved nucleotide sequence 
elements (CSEs) present at the 5' and 3' ends, and a junction region subgenomic 
promoter located internally in the genome (discussed further below). 

10 The positive strand genomic RNA serves as template for the nsP- 

catalyzed svnthesis of a full-length complementary negative strand. Synthesis of the 
complementary negative strand is catalyzed after binding of the nsP complex to the 3' 
terminal CSE of the positive strand genomic RNA. The negative strand, in turn, serves 
as template for the synthesis of additional positive strand genomic RNA and an 

15 abundantly expressed 26S subgenomic RNA, initiated internally at the junction region 
promoter. Synthesis of additional positive strand genomic RNA occurs after binding of 
the nsP complex to the 3' terminal CSE of the complementary negative strand genomic 
RNA template. Synthesis of the subgenomic mRNA from the negative strand genomic 
RNA template, is initiated from the junction region promoter. Thus, the 5' end and 

20 junction region CSEs of the positive strand genomic RNA are functional only after they 
are transcribed into the negative strand genomic RNA complement (i.e., the 5' end CSE 
is functional when it is the 3' end of the genomic negative stranded complement). The 
structural proteins (sPs) are translated from the subgenomic 26S RNA, which represents 
the 3' one-third of the genome, and like the nsPs, are processed post-translationally into 

25 the individual proteins. 

Several groups have suggested utilizing certain members of the 
alphavirus genus as an expression vector, including, for example, Sindbis virus (Xiong 
et al.. Science 245:1188-1191, 1989; Hahn et ah, Proc. Nail. Acad. Sci. USA 89:2679- 
2683, 1992; Dubensky et ah, J. Virol. 70:508-519, 1996), Semliki Forest virus 

30 (Liljestrom, Bio/Technology 9:1356-1361, 1991), and Venezuelan Equine Encephalitis 
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virus (Davis et aL J. Cell Biochem. SuppL 19A:\0 y 1995). In addition, one group has 
suggested using alphavirus-derived vectors for the delivery of therapeutic genes in vivo. 
One difficulty, however, with the above-referenced vectors is that inhibition of host 
celt-directed macromolecular synthesis {i.e., protein or RNA synthesis) begins within a 
5 few hours after infection and cytopathic effects (CPE) occur within 12 to 16 hours post 
infection (hpi). Inhibition and shutoff of host cell protein synthesis begins within 2 hpi 
in BHK cells infected with recombinant viral particles, in the presence or absence of 
structural protein expression, suggesting that the early events after virus infection (e.g., 
synthesis of nsPs and minus strand RNA) may directly influence the inhibition of host 

10 cell protein synthesis and subsequent development of CPE and cell death. 

SIN-1 is a variant strain derived from wild-type Sindbis, and was 
isolated from a culture of BHK cells persistently infected with Sindbis virus over a 
period of one month (Weiss et al. J. Virol. 33: 463-474, 1980). A pure SIN-1 virus 
stock obtained by expansion from a singie plaque does not kill the BHK cells which it 

15 infects. Importantly, virus yields (>10 3 PFU/cell) are the same in BHK cells infected 
with wild-type Sindbis virus or the variant SIN-1 virus. Thus, the principle phenotype 
of SIN-1 in infected BHK cells is characterized by production of wild-type levels of 
infectious virus in the absence of virus-induced cell death. 

The present invention provides recombinant vectors with selected 

20 desirable phenotypes for use in a variety of applications, including for example, gene 
therapy and recombinant protein production, and further provides other related 
advantages. 

Summary of the Invention 

25 Briefly stated, the present invention provides RNA vector replicons, 

alphavirus vector constructs, eukaryotic layered vector initiation systems and 
recombinant alphavirus particles which exhibit reduced, delayed, or no inhibition of 
cellular macromolecular synthesis (e.g., protein or RNA synthesis), thereby permitting 
the use of these vectors for protein expression, gene therapy and the like, with reduced, 

30 delayed, or no development of CPE or cell death. Such vectors may be constructed 
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from a wide variety of alphaviruses (e.g., Semliki Forest virus. Ross River virus, 
Venezuelan equine encephalitis virus or Sindbis virus), and designed to express 
numerous heterologous sequences (e.g., a sequence corresponding to protein, a 
sequence corresponding to antisense RNA. a sequence corresponding to non-coding 
5 sense RNA. or a sequence corresponding to ribozyme). 

Within one aspect of the invention, isolated nucleic acid molecules are 
provided comprising an altered alphavirus nonstructural protein gene which, when 
operablv incorporated into a recombinant alphavirus, increases the time required to 
reach 50% inhibition of host-cell directed macromolecular synthesis following 

10 expression in mammalian cells, as compared to a wild-type alphavirus. As utilized 
within the context of the present invention, "altered alphavirus nonstructural protein 
gene" refers to a gene which, when operablv incorporated into an alphavirus RNA 
vector replicon, recombinant alphavirus panicle, or eukaryotic layered vector initiation 
svstem, produces the desired phenotype (e.g., reduced, delayed or no inhibition of 

15 cellular macromolecular synthesis). Such altered alphavirus nonstructural protein genes 
will have one or more nucleotide substitutions, deletions, or insertions, which alter the 
nucleotide sequence from that of the wild-type alphavirus gene. The gene may be 
derived either artificially (e.g., from directed selection procedures; see Example 2 
below), or from naturally occurring viral variants (see Example 1 below), in addition, it 

20 should be understood that when the isolated nucleic acid molecules of the present 
invention are incorporated into an alphavirus RNA vector replicon, recombinant 
alphavirus particle, or eukaryotic layered vector initiation system as discussed above, 
that they may, within certain embodiments, substantially increase the time required to 
reach 50% inhibition of host-ceil directed macromolecular synthesis, up to and 

25 including substantially no detectable inhibition of host-cell directed macromolecular 
synthesis (over any penod of time). Assays suitable for detecting percent inhibition of 
host-cell directed macromolecular synthesis include, for example, that described within 
Example 1. 

Within other aspects of the invention, isolated nucleic acid molecules are 
30 provided comprising an altered alphavirus nonstructural protein gene which, when- 
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operably incorporated into a recombinant aiphavirus particle, eukaryotic layered vector 
initiation system, or RNA vector replicon. results in a reduced levei (e.g., 2-fold, 5-fold, 
10-foid, 50-fold or more than 100-fold) of vector-specific RNA synthesis as compared 
to the wild-type, and the same or greater level of protein encoded by RNA transcribed 
5 from the viral junction region promoter, as compared to a wild-type recombinant 
aiphavirus particle, wild-type eukaryotic layered vector initiation system, or wild-type 
RNA vector replicon. Representative assays for quantitating RNA levels include [ J H] 
uridine incorporation as described in Example 1. or RNA accumulation as detected by 
Northern Blot analysis (see Example 4). Representative assays for quantitating protein 

10 levels include scanning densitometry (see Example 4) and various enzymatic assays 
(see Examples 3-5). 

Within one embodiment of the above, the isolated nucleic acid molecule 
encodes nonstructural protein 2 (nsP2). Within a further embodiment, the isolated 
nucleic acid molecule has a mutation in the LXPGG motiff of nsP2. 

15 Within another aspect of the invention, expression vectors are provided 

comprising a promoter operably linked to one of the above-described nucleic acid 
molecules. Within one embodiment, the expression vector further comprises a 
polyadenylation sequence or transcription termination sequence 3' to the nucleic acid 
molecule. 

20 Within yet another aspect of the present invention, aiphavirus vector 

constructs are provided, comprising a 5' promoter which initiates synthesis of viral 
RNA in vitro from cDNA. a 5' sequence which initiates transcription of aiphavirus 
RNA, a nucleic acid molecule which operably encodes all four alphaviral nonstructural 
proteins including an isolated nucleic acid molecule as described above, an alphavirfus 

25 viral junction region promoter, an aiphavirus RNA polymerase recognition sequence 
and a 3' polyadenyiate tract. 

Within a related aspect, such constructs further comprise a selected 
heterologous sequence downstream of and operably linked to a viral junction region. 
Within a related aspect, aiphavirus vector constructs are provided comprising a 5* 

30 promoter which initiates synthesis of viral RNA in vitro from cDNA, a 5* sequence 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/21062 



which initiates transcription of alphavirus RNA, a nucleic acid molecule which operably 
encodes all four alphavirus non-structural proteins, an alphavirus viral junction region 
promoter, an alphavirus RNA polymerase recognition sequence, and a 3' polyadenylate 
tract, wherein said in vitro synthesized RNA ; upon packaging into an alphavirus panicle 
5 and introduction of the panicle into a mammalian host cell, increases the time required 
to reach 50% inhibition of host-cell directed macromolecular synthesis following 
expression in mammalian cells, as compared to a wild-type alphavirus panicle. 

Within a further aspect, alphavirus vector constructs are provided 
comprising a 5 ? promoter which initiates synthesis of viral RNA in vitro from cDNA, a 

10 5 ! sequence which initiates transcription of alphavirus RNA, a nucleic acid molecule 
which operably encodes all four alphavirus non-structural proteins, an alphavirus viral 
junction region promoter, an alphavirus RNA polymerase recognition sequence, and a 
3* polyadenylate tract, wherein said in vitro synthesized RNA, upon packaging into an 
alphavirus panicle and introduction of the panicle into a mammalian host cell, has a 

15 reduced level of vector-specific RNA synthesis as compared to wild-type alphavirus 
panicle, and the same or greater level of protein encoded by RNA transcribed from the 
viral junction region promoter, as compared to a wild-type alphavirus panicle. 

Within yet other aspects of the present invention, RNA vector replicons 
capable of translation in a eukaryotic system are provided, comprising a 5' sequence 

20 which initiates transcription of alphavirus RNA. a nucleic acid molecule which operably 
encodes all four alphaviral nonstructural proteins, including the isolated nucleic acid 
molecules discussed above, an alphavirus viral junction region, an alphavirus RNA 
polymerase recognition sequence and a 3' polyadenylate tract. 

Within a related aspect, alphavirus RNA vector replicons capable of 

25 translation in a eukaryotic system are provided, comprising a 5' sequence which 
initiates transcription of alphavirus RNA, a nucleic acid molecule which operably 
encodes all four alphaviral nonstructural proteins, an alphavirus viral junction region 
promoter, an alphavirus polymerase recognition sequence and a 3' polyadenylate tract, 
wherein said alphavirus RNA, upon packaging into an alphavirus panicle and 

30 introduction of the panicle into a mammalian host cell, increases the time required to 
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reach 50% inhibition of host-cell directed macromolecular synthesis following 
expression in mammalian cells, as compared to a wild-type alphavirus particle. 

Within oiher aspects, alphavirus RNA vector replicons capable of 
translation in a eukaryotic system are provided comprising a 5 1 sequence which initiates 
5 transcription of alphavirus RNA, a nucleic acid molecule which operably encodes all 
four alphaviral nonstructural proteins, an alphavirus viral junction region promoter, an 
alphavirus polymerase recognition sequence and a 3' polyadenylate tract, wherein said 
alphavirus RNA, upon packaging into an alphavirus panicle and introduction of the 
particle into a mammalian host cell, has a reduced level of vector-specific RNA 

10 svnthesis as compared to wild-type alphavirus particle, and the same or greater level of 
protein encoded by RNA transcribed from the viral junction region promoter, as 
compared to a wild-type alphavirus particle. 

Within another embodiment, such RNA vector replicons further 
comprise a selected heterologous sequence downstream of and operably linked to a viral 

15 junction region. Within further aspects of the invention, host cells are provided which 
contain one of the RNA vector replicons described herein. Within additional aspects of 
the invention, pharmaceutical compositions are provided comprising RNA vector 
replicons as described above and a pharmaceutically acceptable carrier or diluent. 

Within other aspects of the invention, recombinant alphavirus panicles 

20 are provided, comprising one or more alphavirus structural proteins, a lipid envelope, 
and an RNA vector replicon as described herein. Within one embodiment, one or more 
of the alphavirus structural proteins are derived from a different alphavirus than the 
alphavirus from which the RNA vector replicon was derived. Within other 
embodiments, the alphavirus structural protein and lipid envelopes are derived from 

25 different species. Within further aspects, pharmaceutical compositions are provided 
comprising a recombinant alphavirus particle as disclosed above and a pharmaceutically 
acceptable carrier or diluent. Further, mammalian cells infected with such recombinant 
alphavirus particles are also provided. 

Within certain embodiments of the invention, the above described 

30 vectors or particles may further comprise a resistance marker which has been fused, in- 
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frame, with the heterologous sequence. Representative examples of such resistance 
markers include hygromycin phosphotransferase and neomycin phosphotransferase. 

Within other aspects of the present invention, methods are provided for 
selecting alphavirus or recombinant alphavirus vector variants which exhibit the 
5 phenotype described herein of reduced, delayed, or, no inhibition of host cell directed 
macromolecular synthesis. Representative examples of such methods include the use of 
selectable drug or antigenic markers and are provided in more detail below in Example 
~) 

Within other aspects of the present invention. Togavirus capsid panicles 

10 are provided which contain substantially no genomic (i.e., wild-type virus genome) or 
RNA vector replicon nucleic acids. Representative examples of Togaviruses include, 
for example alphaviruses and rubi viruses (e.g., rubella). Within certain embodiments, 
the capsid panicles funher comprise a lipid envelope containing one or more alphavirus 
glycoproteins. Within other embodiments, the capsid panicle further comprises an 

15 alphavirus envelope (i.e., the lipid bilayer and the glycoprotein complement). Within 
related aspects of the present invention, pharmaceutical compositions are provided 
comprising the above noted capsid panicles (with or without a lipid bilayer (e.g., viral 
envelope containing alphavirus glycoproteins)) along with a pharmaceutically 
acceptable carrier or diluent. Within funher aspects, such capsid panicles (with or 

20 without a lipid bilayer (e.g., viral envelope containing alphavirus glycoproteins)) or 
pharmaceutical compositions may be utilized as a vaccinating agent in order to induce 
an immune response against a desired togavirus. 

Within further aspects of the invention, inducible promoters are provided 
comprising a core RNA polymerase promoter sequence, an operably linked nucleic acid 

25 sequence that directs the DNA binding of a protein that activates transcnption from the 
core promoter sequence, and an operably linked nucleic acid sequence that directs the 
DNA binding of a protein that represses transcription from the core promoter sequence. 
Such promoters may be utilized in the gene delivery vehicles described herein, as well 
as a wide variety of other vectors known to those skilled in the art. 
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Within other aspects, alphavirus structural protein expression cassettes 
are provided comprising a 5' promoter which initiates synthesis of viral RNA from 
DNA, a nucleic acid molecule which encodes one or more functional alphavirus 
structural proteins, a selectable marker operably linked to transcription of the expression 
5 cassette, and optionally, a 3' sequence which controls transcription termination. Within 
one embodiment, such expression cassettes further comprise a 5' sequence which 
initiates transcription of alphavirus RNA, a viral junction region promoter, and an 
alphavirus RNA polymerase recognition sequence. Within another embodiment the 
expression cassette further comprises a catalytic ribozyme processing sequence, post- 
10 translational transcriptional regulatory elements which facilitate RNA export from the 
nucleus, and/or elements which permit translation of multicistronic mRNA. selected 
from the group consisting of Internal Ribosome Entry Site elements, elements 
promoting ribosomal read through and BiP sequence. Within other embodiments, the 
selectable marker is operably linked to a 5' promoter capable of initiating synthesis of 
15 alphavirus RNA from cDNA. Within further embodiments, the selectable marker is 
positioned downstream from a junction region promoter and from the nucleic acid 
molecule which encodes alphavirus structural proteins. Within yet other embodiments, 
the 5 1 promoter is an inducible promoter as described herein. Within another 
embodiment, the alphavirus structural protein expression cassette further comprises an 
20 alphavirus capsid protein gene or other sequence (e.g., a tobacco etch virus or "TEV" 
ieader) which is capable of enhancing translation of one or more functional alphavirus 
structural protein genes located 3' to the enhancer sequence. Preferably, the capsid 
protein gene sequence is derived from a different alphavirus than that from which the 
sequence encoding the alphavirus structural genes is obtained. 
25 Within yet other aspects of the invention, alphavirus packaging cell lines 

are provided comprising a cell containing an alphavirus structural protein expression 
cassette as described above. In certain embodiments, the alphavirus packaging cell 
lines are stably transformed with the alphavirus structural protein expression cassettes 
provided herein. Within related aspects, alphavirus producer cell lines are provided 
30 comprising a cell which contains a stably transformed alphavirus structural protein 
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expression cassette, and a vector selected from the group consisting of RNA vector 
replicons, alphavirus vector constructs and eukaryotic layered vector initiation systems. 

Within yet other aspects of the present invention, eukaryotic layered 
vector initiation systems are provided, comprising a 5' promoter capable of initiating 
5 in vivo the 5' synthesis of RNA from cDNA, a sequence which initiates transcription of 
alphavirus RNA following the 5' promoter: a nucleic acid molecule which operably 
encodes all four alphaviral nonstructural proteins, including an isolated nucleic acid 
molecule as discussed above, an alphavirus RNA polymerase recognition sequence, and 
a 3' polyadenylate tract. 

10 Also provided are eukaryotic layered vector initiation systems 

comprising a 5' promoter capable of initiating in vivo the 5' synthesis of alphavirus 
RNA from cDNA. a sequence which initiates transcription of alphavirus RNA 
following the 5' promoter, a nucleic acid molecule which operably encodes all four 
alphaviral nonstructural proteins, an alphavirus RNA polymerase recognition sequence, 

15 and a 3' polyadenylate tract, wherein the in vivo synthesized RNA. upon packaging into 
an alphavirus particle and introduction of the panicle into a mammalian host cell, 
increases the time required to reach 50% inhibition of host-cell directed macromolecular 
synthesis following expression in mammalian cells, as compared to a wild-type 
alphavirus panicle. 

20 Related eukaryotic layered vector initiation system are also provided 

which comprise a 5' promoter capable of initiating in vivo the 5' synthesis of alphavirus 
RNA from cDNA. a sequence which initiates transcription of alphavirus RNA 
following the 5' promoter, a nucleic acid molecule which operably encodes all four 
alphaviral nonstructural proteins, an alphavirus RNA polymerase recognition sequence. 

25 and a 3' polyadenylate tract, wherein said in vivo synthesized RNA, upon packaging 
into an alphavirus particle and introduction of the particle into a mammalian host cell, 
has a reduced level of vector-specific RNA synthesis as compared to wild-type 
alphavirus particle, and the same or greater level of protein encoded by RNA 
transcribed from the viral junction region promoter, as compared to a wild-type 

30 alphavirus particle. 
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Representative examples of suitable 5' promoters for eukaryotic layered 
vector initiation systems include RNA polymerase I promoters, RNA polymerase II 
promoters. RNA polymerase III promoters, the HSV-TK promoter. RSV promoter, 
tetracycline inducible promoter, MoMLV promoter, a SV40 promoter and a CMV 
5 promoter. Within preferred embodiments, the 5 ! promoter is an inducible promoter as 
described herein. 

Within certain embodiments, eukaryotic layered vector initiation systems 
are provided which further comprise a heterologous sequence operably linked to a viral 
junction region, and/or a post-transcriptional regulatory element which facilitates RNA 
0 export from the nucleus. Within further embodiments, the eukaryotic layered vector 
initiation svstems provided herein may further comprise a transcription termination 
signal. 

Within related aspects, the present invention also provides host cells 
(e.g., vertebrate or insect) containing a stably transformed eukaryotic layered vector 

5 initiation system as described above. Within further aspects of the present invention, 
methods for delivering a selected heterologous sequence to a vertebrate or insect are 
provided, comprising the step of administering to a vertebrate or insect an alphavirus 
vector construct, RNA vector replicon. recombinant alphavirus particle, or a eukaryotic 
layered vector initiation system as described herein. Within certain embodiments, the 

0 aiphavirus vector construct. RNA vector replicon. recombinant alphavirus particle or 
eukaryotic layered vector initiation system is administered to cells of the vertebrate 
ex vivo, followed by administration of the vector or particle-containing cells to a warm- 
blooded animal. 

Within other aspects, pharmaceutical compositions are provided 
5 comprising a eukaryotic layered vector initiation system as discussed above, and a 
pharmaceutically acceptable carrier or diluent. Within certain embodiments, the 
pharmaceutical composition is provided as a liposomal formulation. 

Within further aspects, methods of making recombinant alphavirus 
particles are provided, comprising the steps of (a) introducing a vector such as a 
0 eukaryotic layered vector initiation system, RNA vector replicon, or alphavirus vector 
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panicle as described above into a population of packaging cells under conditions and 
for a time sufficient to permit production of recombinant alphavirus particles, and 
(b) harvesting recombinant alphavirus panicles. Within related aspects, methods of 
making a selected protein are provided, comprising the steps of (a) introducing a vector 
5 which encodes a selected heterologous protein, such as a eukaryotic layered vector 
initiation system. RNA vector replicon or alphavirus vector panicle described above, 
into a population of packaging cells, or other cells under conditions and for a time 
sufficient to permit production of the selected protein, and (b) harvesting protein 
produced by the vector containing cells. Within yet other aspects, methods of making a 

10 selected protein are provided, comprising the step of introducing a eukaryotic layered 
vector initiation system which is capable of producing a selected heterologous protein 
into a host cell, under conditions and for a time sufficient to permit expression of the 
selected protein. Within funher aspects, host cell lines are provided which contain a 
RNA vector replicon as described herein. 

15 Within yet other aspects of the present invention, alphavirus vaccines are 

provided, comprising one of the above-described alphavirus vector constructs, RNA 
vector replicons. eukaryotic vector initiation systems, or recombinant alphavirus 
panicles, which may or may not express one of the heterologous sequences provided 
herein (e.g., they may be utilized solely as a vaccine for treating or preventing 

20 alphaviral diseases). For example, within one embodiment of the invention, 
recombinant togavirus panicles are provided which have substantially no nucleic acid 
or RNA vector replicon nucleic acid. Within a funher embodiment, recombinant 
togavirus particles are provided which contain heterologous viral nucleic acids (i.e., 
from a different virus than the togavirus particle). Within yet another embodiment, the 

25 recombinant togavirus particle is T=3 or greater. 

Within further aspects of the invention, recombinant chimeric togavirus 
particles (either empty, or containing nucleic acids) are provided wherein the viral 
particle has viral structural components obtained or derived from different Togaviridae 
(e.g., the capsid protein and glycoprotein is obtained from different alphavirus sources). 
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Within other aspects of the invention, methods for stimulating an 
immune response within a vertebrate are provided, comprising the step of administering 
to a vertebrate an alphavirus vector construct, an alphavirus RNA vector replicon 
according, a recombinant alphavirus particle, or a eukaryotic layered vector initiation 
5 system, wherein the alphavirus vector construct, RNA vector replicon, particle, or 
eukaryotic layered vector initiation system expresses an antigen which stimulates an 
immune response within said vertebrate (see, e.g., U.S. Serial No. 08/404,796 for 
suitable antigens!. Within related aspects, methods are provided for inhibiting a 
pathogenic agent within a vertebrate, comprising the step of administering to a 

10 vertebrate an alphavirus vector construct, an alphavirus RNA vector replicon. a 
recombinant alphavirus particle, or a eukaryotic layered vector initiation system 
according, wherein said alphavirus vector construct, RNA vector replicon. particle, or 
eukaryotic layered vector initiation system expresses an palliative which is capable of 
inhibiting a pathogenic agent (see, e.g., U.S. Serial No. 08/404,796 for suitable 

15 palliatives). 

These and other aspects and embodiments of the invention will become 
evident upon reference to the following detailed description and attached figures. In 
addition, various references are set forth herein that describe in more detail certain 
procedures or compositions (e.g., plasmids. sequences, etc.), and are therefore 
20 incorporated by reference in their entirety as if each were individually noted for 
incorporation. 

Brief Description o f the Figures 

Figure 1 is a schematic illustration of Sindbis virus and general 
25 alphavirus genomic organization and replication strategy. 

Figure 2 is a graph of virus release from BHK cells infected at an MOI 
of 10 with SIN-1, SIN-l/nsPl-4, TotollOl, or Sin-l/nsP2 viruses. Cell culture fluids 
were collected at 3, 6, 9 and 12 hours post-infection. Virus titers were determined by 
plaque assay. 
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Figure 3 is a graph depicting viral RNA synthesis in BHK ceils 
following infection by TotollOI, SIN-l/nsP2 ? SIN-l/nsPl-4, or SIN-1 virus. Cells 
were infected at an MOI 10 and at 1 hour post-infection, actinomycin D and J H-uridine 
were added. At 3. 6, 9, and 12 hpi the amount of 3 H-uridine incorporation was 
5 determined. 

Figure 4 is a graph depicting viral RNA synthesis in BHK cells infected 
by SIN-l/nsPl. SrN-l/nsP2 ; SIN-l/nsP3. SIN-l/nsP3-4, SIN-l/nsP4, SIN-l/nsP2-C, 
SIN-l/nsP2-N, TotollOI, SIN-1. or SIN-l/nsPl-4. The levels of 3 H-uridine 
incorporation are expressed relative to wild-type (Toto 1 101) infection. 
10 Figure 5 is a graph depicting the shut-off of host cell protein synthesis in 

BHK cells infected by SrN-InsPl-4. SIN-1, SIN-lnsP2, or TotollOI viruses. 

Figures 6A-6D is the cDNA sequence of 8000 bases of SIN-1 virus 
(SEQ. ID NO. 101). 

Figure 7A-7D is the cDNA sequence of 8000 bases of SINCG virus 

15 (SEQ. ID NO. 102). 

Figures SA-SE are the cDNA sequence of Toto 1 101 virus (SEQ. ID NO. 

103). 

Figure 8F is a schematic illustration depicting selection of vectors 
expressing the desired phenotype using a selectable marker. 
20 Figure 8G is a northern blot analysis of RNAs isolated from G418- 

resistant BHK-21 cell pools stably transformed with a variant Sindbis virus vector or 
Semliki Forest virus vector expressing neomycin phosphotransferase. 

Figure 8H is a schematic illustration of the genetic determinants 
responsible for the desired phenotype in variant Sindbis virus vectors. 
25 Figure 9A is a northern blot analysis of RNAs isolated from BHK-21 

ceils that were transfected with pBG/SIN-1 ELVS 1.5-SEAP or pBG/wt ELVS 1.5- 
SEAP plasmid DNAs, and hybridized with a radiolabeled viral RNA probe. 

Figure 9B is a graph depicting a 7 day timecourse of alkaline 
phosphatase expression in BHK cells transfected with pBG/SIN-I ELVS 1.5-SEAP or 
30 pBG/wt ELVS 1 .5-SEAP plasmid DNAs. 
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Figure 10 is a graph depicting a 4 day timecourse of luciferase 
expression in BHK cells transfected with pBG/STN-1 ELVS 1.5-luc or pBG/wt ELVS 
1.5-luc plasmid DNAs. 

Figure 1 1 A is a northern blot analysis of RNAs isolated from BHK-21 
5 cells that were transfected with pBG/SIN-1 ELVS-1 .5-P-gal or pBG/wt ELVS 1. 5-P-gal 
plasmid DNAs, and hybridized with a radiolabeled viral RNA probe. 

Figure 1 IB is a western blot analysis detecting p-gal expression in BHK- 
21 cells transfected with either pBG/SIN-I ELVS-1 .5-P-gal or pBG/wt ELVS 1.5-p-gal 
plasmid DNAs. 

0 Figure 11C is a graph depicting a 5 day timecourse of alkaline 

phosphatase expression in BHK cells transfected with pBG/SrN-l ELVS-1 .5-p-gal or 
pBG/wt ELVS 1. 5-P-gal plasmid DNAs. 

Figure 12A & B are graphs depicting p-gal expression in HT10S0 and 
BHK-21 cells transfected with ELVS p-gal vectors with or without HBV PRE 
5 sequences, as measured by RLU (relative light units). 

Figure 13 is a schematic illustration of RNA amplification, structural 
protein expression, and vector packaging by vector inducible alphavirus packaging cell 
lines. 

Figure 14 is a schematic illustration of vector inducible structural protein 
0 expression cassettes used in the generation of alphavirus packaging cell lines. 

Figure 15 is a graph depicting luciferase vector packaging (transfer of 
expression) by different alphavirus packaging cell lines. 

Figure 16 is a western blot analysis demonstrating induction of structural 
protein expression by an alphavirus packaging cell line following transfection and 
5 subsequent expression with an alphavirus vector (ELVS-Pgal), but not a conventional 
plasmid DNA expression vector (pCMV-pgal). 

Figure 17A is a graph depicting luciferase vector packaging by C6/36 
mosquito cells containing the pDCMV-intSINrbz structural protein expression cassette. 

Figure 17B is a graph depicting luciferase vector packaging by human 
0 293 packaging cells stably transformed with plasmid pBGSVCMVdlneo. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/21062 



17 

Figure 18 is a graph depicting luciferase vector packaging by different 
alphavirus packaging cell lines. 

Figure 19 is an RNA gel autoradiograph depicting "H uridine-labeled 
RNAs from BHK cells infected with SINrep/LacZ vector particles produced from an 
5 alphavims packaging cell line. 

Figure 20 is a protein gel autoradiograph depicting 35 S methionine- 
labeled proteins from BHK cells infected with SINrep/LacZ vector particles produced 
from an alphavirus packaging cell line. 

Figure 21 A is a schematic illustration depicting packaging of alphavirus 
10 vectors with structural proteins in which the capsid protein and glycoproteins are 
expressed from distinct, or "split", expression cassettes. 

Figure 21 B is a schematic illustration depicting the structural protein 
expression cassettes in which the capsid protein and glycoproteins are separated, used to 
derive stable split structural gene packaging cell lines. 
15 Figure 21C is a western blot analysis demonstrating induction of 

alphavirus capsid protein synthesis by several clonal cell lines following transfection 
and subsequent synthesis of alphavirus glycoproteins or Pga! from an alphavirus 
expression vector (EL VS-1.5PE [1.5 PE], or ELVS-Pgai [Pgal]). 

Figure 21 D is a western blot analysis demonstrating induction of 
20 alphavirus capsid protein synthesis by several clonal cell lines following transfection 
and subsequent expression with an alphavirus expression vector (ELVS-pgai). 

Figure 22 is a schematic illustration of the region of structural protein 
expression cassettes comprising a wild-type or deletion mutant Ross River virus capsid 
protein gene. 

25 Figure 23 is a schematic illustration of vector inducible structural protein 

expression cassettes containing a wild-type or deletion mutant Ross River virus capsid 
protein gene. 

Figure 24 is a schematic illustration of vector packaging by "split" 
structural protein gene expression cassettes which contain a Ross River virus capsid 
30 protein gene sequence upstream of the Sindbis virus glycoprotein genes. 
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Figure 25 is a schematic illustration of vector packaging by "split" 
structural protein gene expression cassettes which contain a Ross River virus capsid 
protein gene sequence upstream of the Sindbis virus glycoprotein genes on one cassette, 
and the Sindbis virus capsid protein gene in a separate cassette. 
5 Figure 26A is a table showing the results of vector particle packaging 

using the above "split" structural protein gene expression cassettes. 

Figure 26B is two graphs which depict the packaging activity of 25 
clonal cell lines from drug-resistant cell pools derived by stable transfection with the 
split structural protein gene expression cassettes illustrated in Figure 2 IB. relative to a 
10 genomic structural protein gene PCL (987 dlneoj. 

Figure 26C is a western blot analysis demonstrating induction of 
structural protein expression by three split structural gene alphavirus packaging cell 
lines following transfection and subsequent expression with an alphavirus vector 
(ELVS-pgal), but not a conventional plasmid DNA expression vector (pCI-Pgal). 
15 Figure 26D is a graph depicting the amplification and production of [3- 

gal protein over time in several split structural gene alphavirus packaging cell lines 
(Clone 9TD, Clone 2TD, Clone 24TD, Clone 20SS), relative to a genomic structural 
protein gene PCL (987 genomic PCL). 

Figure 27 is a schematic illustration of the use of alphavirus packaging 
20 cell lines for the amplification of packaged vector particle preparations and the large 
scale production of recombinant protein. 

Figure 28 is a graph depicting the amplification and production of (3-gal 
protein over time using alphavirus packaging cell lines. 

Figure 29 is a schematic illustration of the use of a tetracycline regulated 
25 promoter system to control expression of alphavirus vector RNA from cDNA in vivo. 

Figure 30 is a schematic illustration of the use of a linked transcriptional 
repressor and a transcriptional inducer/activator regulated promoter system to control 
expression of alphavirus vector RNA from cDNA in vivo. 

Figure 31 A & B are autoradiography of [ J H]uridine-labeled RNAs 
30 electrophoresed on denaturing glyoxal gels that were isolated from BHK cell 
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electroporated with SfNrep/LacZ replicon and DH RNAs from various RRV capsid 
containing DH constructs, and from the vector panicles present in the culture fluids at 
1 8 hours post electroporation. 

Figure 32A &l B are protein gel autoradiographs depicting "S 
5 methionine-labelled proteins from BHK cells electroporated with SINrep/LacZ repiicon 
and DH RNAs from various RRV capsid containing DH constructs, and from the vector 
particles present in the culture fluids at 18 hours post electroporation. 

Figure 33A-D are Kyte-Doolittle hydrophobicity plots of various Ross 
River virus (RRV) capsid proteins, expressed from the wild-type gene (A) and three 
10 deletion mutants CAlrrw CA2rrv, and CA3rrv (B-D. respectively). 

Figure 34 is a schematic that illustrates the amino-terminus RRV capsid 
proteins expressed from the wild-type gene (SEQ. ID NO. 114), and three deletion 
mutants CAlrrv (SEQ. ID NO. 115), CA2rrv (SEQ. ID NO. 1 16), and CA3rrv (SEQ. ID 
NO. 1 1 7). The lysine residues deleted in the RRV capsid gene mutants are indicated. 
15 Figure 35 is a graph that illustrates the relative levels of [ j3 S]methionine 

and [ 3 H]uridine incorporated into virus particles in BHK cells infected at high MOI with 
TotollOl wild-type virus. 

Figure 36 is a graph that illustrates the relative levels of [ 33 S] methionine 
and [ 3 H]uridine incorporated into virus panicles in BHK cells electroporated with 
20 SINrep/lacZ and DH-BB (5' tRNA) Cm' DH RNAs. 

Figure 37 is a graph that illustrates the relative levels of [ 35 S]methionine 
and [ 3 H]uridine incorporated into virus panicles in BHK cells electroporated with 
SINrep/lacZ and RRV capsid deletion mutant DH-BB (5' tRNA) CA3rrv DH RNAs, 

Figure 38 is a graph which compiles the results shown in Figures 35-37, 
25 depicting the relative levels of [ 35 S]methionine and [ 3 H]uridine incorporated into virus 
panicles in BHK cells electroporated with SINrep/lacZ and DH RNAs, or infected with 
Totol 101 wild-type vims. 

Figure 39 is a graph which illustrates luciferase vector packaging by 
BHK cells stably transformed with pBGSVCMVdlhyg. 

30 
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Definition of Terms 

The following terms are used throughout the specification. Unless 
otherwise indicated, these terms are defined as follows: 

" Genomic RNA " refers to RNA which contains all of the genetic 
5 information required to direct its own amplification or self-replication in vivo, within a 
target ceil. To direct its own replication, the RNA molecule may: 1) encode one or 
more polymerase, replicase. or other proteins which may interact with viral or host cell- 
derived proteins, nucleic acids or ribonucleoproteins to catalyze the RNA amplification 
process; and 2) contain cis RNA sequences required for replication, which may be 

10 bound during the process of replication by its self-encoded proteins, or non-self- 
encoded cell-derived proteins, nucleic acids or ribonucleoproteins, or complexes 
between any of these components. An alphavirus-denved genomic RNA molecule 
should contain the following ordered elements: 5' viral or defective-interfering RNA 
sequence(s) required in as for replication, sequences which, when expressed, code for 

15 biologically active aiphavirus nonstructural proteins {e.g., nsPl, nsP2, nsP3, nsP4), 3* 
viral sequences required in cis for replication, and a polyadenylate tract. The 
alphavirus-derived genomic RNA vector replicon also may contain a viral subgenomic 
"junction region" promoter which may, in certain embodiments, be modified in order to 
prevent, increase, or reduce viral transcription of the subgenomic fragment, and 

20 sequences which, when expressed, code for biologically active aiphavirus structural 
proteins (e.g., C. E3, E2. 6K. El). Generally, the term genomic RNA refers to a 
molecule of positive polarity, or "message" sense, and the genomic RNA may be of 
length different from that of any known, naturally-occurring aiphavirus. In preferred 
embodiments, the genomic RNA does not contain the sequences which encode any 

25 alphaviral structural protein(s); rather those sequences are substituted with heterologous 
sequences. In those instances where the genomic RNA is to be packaged into a 
recombinant aiphavirus particle, it must contain one or more sequences which serve to 
initiate interactions with aiphavirus structural proteins that lead to particle formation, 
and preferably is of a length which is packaged efficiently by the packaging system 

30 being employed. 
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" ^genomic RNA ". or "26S" RNA, refers to a RNA molecule of a 
length or size which is smaller than the genomic RNA from which it was derived. The 
subgenomic RNA should be transcribed from an internal promoter whose sequences 
reside within the genomic RNA or its complement. Transcription of the subgenomic 
5 RNA may be mediated by viral-encoded polymerase^), host cell-encoded 
polymerase(s), transcription factor(s), nbonucleoprotein(s), or a combination thereof. 
In preferred embodiments, the subgenomic RNA is produced from a vector according to 
the invention, and encodes or expresses the gene(s) or sequence(s) of interest. The 
subcenomic RNA need not necessarily have a sedimentation coefficient of 26. 

10 " Alphavirus vector construct " refers to an assembly which is capable of 

directing the expression of a sequence(s) or gene(s) of interest. Such vector constructs 
are comprised of a 5' sequence which is capable of initiating transcription of an 
alphavirus RNA (also referred to as 5' CSE, in background), as well as sequences 
which, when expressed, code for biologically active alphavirus nonstructural proteins 

15 (e.g., nsPl, nsP2. nsP3, nsP4), and an alphavirus RNA polymerase recognition 
sequence (also referred to as 3' CSE, in background). In addition, the vector construct 
should include a viral subgenomic "junction region" promoter which may, in certain 
embodiments, be modified in order to prevent, increase, or reduce viral transcription of 
the subgenomic fragment, and also a polyadenylate tract. The vector also may include 

20 sequences from one or more structural protein genes or portions thereof, extraneous 
nucleic acid moleculefs) which are of a size sufficient to allow production of viable 
virus, a 5' promoter which is capable of initiating the synthesis of viral RNA in vitro 
from cDNA, a heterologous sequence to be expressed, as well as one or more restriction 
sites for insertion of heterologous sequences. 

25 " Alphavirus RNA vector replicon ". " RNA vect or replicon" and 

" replicon " refers to a RNA molecule which is capable of directing its own amplification 
or self-replication in vivo, within a target cell. To direct its own amplification, the RNA 
molecule may: 1) encode one or more polymerase, replicase, or other proteins which 
may interact with viral or host cell-derived proteins, nucleic acids or ribonucieoproteins 

30 to catalyze RNA amplification; and 2) contain cis RNA sequences required for 
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replication which may be bound by its self-encoded proteins, or non-self-encoded cell- 
derived proteins, nucleic acids or ribonucleoproteins. or complexes between any of 
these components. In certain embodiments, the amplification also may occur in vitro. 
An alphavirus-derived RNA vector replicon molecule should contain the following 
5 ordered elements: 5' viral sequences required in cis for replication (also referred to as 5' 
CSE, in background), sequences which, when expressed, code for biologically active 
alphavirus nonstructural proteins {e.g., nsPl, nsP2, nsP3, nsP4) ; 3' viral sequences 
required in cis for replication (also referred to as 3' CSE, in background), and a 
polyadenylate tract. The alphavirus-derived RNA vector replicon also may contain a 

10 viral subgenomic "junction region" promoter which may, in certain embodiments, be 
modified in order to prevent, increase, or reduce viral transcription of the subgenomic 
fragment, sequences from one or more structural protein genes or portions thereof, 
extraneous nucleic acid molecule(s) which are of a size sufficient to allow production of 
viable virus, as well as heterologous sequence(s) to be expressed. The source of RNA 

15 vector replicons in a cell may be from infection with a virus or recombinant alphavirus 
particle, or transfection of plasmid DNA or in vitro transcribed RNA. 

" Recombinant Alphavirus Particle " refers to a virion unit containing an 
alphavirus RNA vector replicon. Generally, the recombinant alphavirus particle 
comprises one or more alphavirus structural proteins, a lipid envelope and an RNA 

20 vector replicon. Preferably, the recombinant alphavirus panicle contains a nucleocapsid 
structure that is contained within a host cell-derived lipid bilayer. such as a plasma 
membrane, in which alphaviral-encoded envelope glycoproteins are embedded. The 
particle may also contain other components (e.g., targeting elements such as biotin, 
other viral structural proteins, or other receptor binding ligands) which direct the 

25 tropism of the panicle from which the alphavirus was derived, or other RNA molecules. 

" Stnictural protein expression cassette " refers to a nucleic acid molecule 
which is capable of directing the synthesis of one or more alphavirus structural proteins. 
The expression cassette should include a 5' promoter which is capable of initiating 
in vivo the synthesis of RNA from cDNA, as well as sequences which, when expressed, 

30 code for one or more biologically active alphavirus structural proteins {e.g., C, E3, E2, 
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6K. El), and a 3' sequence which controls transcription termination. The expression 
cassette also may include a 5' sequence which is capable of initiating transcription of an 
alphavirus RNA (also referred to as 5' CSE, in background), a viral subgenomic 
"junction region" promoter, and an alphavirus RNA polymerase recognition sequence 
5 (also referred to as 3' CSE, in background). In certain embodiments, the expression 
cassette also may include splice recognition sequences, a catalytic ribozyme processing 
sequence, a sequence encoding a selectable marker, a nuclear export signal as well as a 
polyadenyiation sequence. In addition, expression of the alphavirus structural protein(s) 
mav, in certain embodiments, be regulated by the use of an inducible promoter. 

10 " Stable Transformation " refers to the introduction of a nucieic acid 

molecule into a living cell, and long-term or permanent maintenance of that nucleic acid 
molecule in progeny cells through successive cycles of cell division. The nucleic acid 
molecule may be maintained in any cellular compartment, including, but not limited to, 
the nucleus, mitochondria, or cytoplasm. In preferred embodiments, the nucleic acid 

15 molecule is maintained in the nucleus. Maintenance may be intrachromosomal 
(integrated) or extrachromosomal, as an episomal event. 

" Alphavirus packaging cell line " refers to a cell which contains an 
alphavirus structural protein expression cassette and which produces recombinant 
alphavirus particles after introduction of an alphavirus vector construct. RNA vector 

20 repiicon. eukaryotic layered vector initiation system, or recombinant alphavirus panicle. 
The parental cell may be of mammalian or non-mammalian origin. Within preferred 
embodiments, the packaging cell line is stably transformed with the structural protein 
expression cassette. 

" Alphavirus producer cell line " refers to a cell line whicfi is capable of 

25 producing recombinant alphavirus particles, comprising an alphavirus packaging cell 
line which also contains an alphavirus vector construct, RNA vector repiicon, 
eukaryotic layered vector initiation system, or recombinant alphavirus particle. 
Preferably, the alphavirus vector construct is eukaryotic layered vector initiation 
system, and the producer cell line is stably transformed with the vector construct. In 

30 preferred embodiments, transcription of the alphavirus vector construct and subsequent 
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production of recombinant alphavirus panicles occurs only in response to one or more 
factors, or the differentiation state of the alphavirus producer cell line. 

" Defective helper construct " refers to an assembly which is capable of 
RNA amplification or replication, and expression of one or more alphavirus structural 
5 proteins in response to biologically active alphavirus nonstructural proteins supplied in 
trans. The defective helper construct should contain the following ordered elements: 5' 
viral or defective-interfering RNA sequences required in cis for replication, a viral 
subeenomic junction region promoter, sequences which, when expressed, code for one 
or more biologically active alphavirus structural proteins (e.g., C. E3 ; E2, 6fC, El), 3' 

10 viral sequences required in cis for replication, and a polyadenylate tract. The defective 
helper construct also may contain a 5' promoter which is capable of initiating the 
synthesis of viral RNA from cDNA, a 3' sequence which controls transcription 
termination, splice recognition sequences, a catalytic ribozyme processing sequence, a 
sequence encoding a selectable marker, and a nuclear export signal. 

15 " Eukarvotic Layered Vector Initiation System " refers to an assembly 

which is capable of directing the expression of a sequence(s) or gene(s) of interest. The 
eukarvotic layered vector initiation system should contain a 5' promoter which is 
capable of initiating in vivo (i.e. within a cell) the synthesis of RNA from cDNA, and a 
nucleic acid vector sequence which is capable of directing its own replication in a 

20 eukarvotic cell and also expressing a heterologous sequence. The nucleic acid sequence 
which is capable of directing its own amplification may be of viral or non-viral origin. 
In certain embodiments, the nucleic acid vector sequence is an alphavirus-denved 
sequence and is comprised of a 5' sequence which is capable of initiating transcription 
of an alphavirus RNA (also referred to as 5' CSE, in background), as well as sequences 

25 which, when expressed, code for biologically active alphavirus nonstructural proteins 
(e.g., nsPl, nsP2. nsP3, nsP4), and an alphavirus RNA polymerase recognition 
sequence (also referred to as 3' CSE, in background). In addition, the vector sequence 
may include a viral subgenomic "junction region" promoter which may, in certain 
embodiments, be modified in order to prevent, increase, or reduce viral transcription of 

30 the subgenomic fragment, sequences from one or more structural protein genes or 
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portions thereof, extraneous nucleic acid molecule(s) which are of a size sufficient to 
allow optimal amplification, a heterologous sequence to be expressed, one or more 
restriction sites for insertion of heterologous sequences, as well as a polyadenylation 
sequence. The eukaryotic layered vector initiation system may also contain splice 
5 recognition sequences, a catalytic ribozyme processing sequence, a nuclear export 
signal, and a transcription termination sequence. In certain embodiments, in vivo 
synthesis of the vector nucleic acid sequence from cDNA may be regulated by the use 
of an inducible promoter. 

" Alphavirus cDNA vector construct " refers to an assembly which is 

10 capable of directing the expression of a sequence(s) or gene(s) of interest. The vector 
construct is comprised of a 5' sequence which is capable of initiating transcnption of an 
alphavirus RNA (also referred to as 5' CSE), as well as sequences which, when 
expressed, code for biologically active alphavirus nonstructural proteins (e.g., nsPl, 
nsP2, nsP3, nsP4). and an alphavirus RNA polymerase recognition sequence (also 

1 5 referred to as 3' CSE. in background). In addition, the vector construct should include a 
5' promoter which is capable of initiating in vivo the synthesis of viral RNA from 
cDNA, and a 3' sequence which controls transcription termination. Within certain 
embodiments, the vector construct may further comprise a viral subgenomic "junction 
region" promoter which may, in certain embodiments, be modified in order to prevent, 

20 increase, or reduce viral transcription of the subgenomic fragment The vector also may 
include sequences from one or more structural protein genes or portions thereof, 
extraneous nucleic acid molecule(s) which are of a size sufficient to allow production of 
viable virus, a heterologous sequence to be expressed, one or more restriction sites for 
insertion of heterologous sequences, splice recognition sequences, a catalytic ribozyme 

25 processing sequence, a nuclear export signal, as well as a polyadenylation sequence. In 
certain embodiments, in vivo synthesis of viral RNA from cDNA may be regulated by 
the use of an inducible promoter. 

" Gene delivery vehicle " refers to a construct which can be utilized to 
deliver a gene or sequence of interest. Representative examples include alphavirus 
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RNA vector replicons. alphavirus vector constructs, eukaryotic layered vector initiation 
systems and recombinant alphavirus particles. 

Numerous aspects and advantages of the invention will be apparent to 
those skilled in the art upon consideration of the following detailed description which 
5 provides illumination of the practice of the invention. 

Detailed Description of the Invention 

As noted above, the present invention provides novel gene delivery 
vehicles including for example, RNA vector replicons, alphavirus vector constructs, 

10 eukarvotic lavered vector initiation systems and recombinant alphavirus panicles. 
Briefly, introduction of ptasmid DNA-. in vitro transcribed RNA- ; or particle-based 
vectors of the present invention into a cell, results in levels of heterologous gene 
expression that are equivalent, or higher, as compared to expression levels of wild-type 
derived alphaviral vectors. Unexpectedly however, the level of vector-specific RNA 

15 synthesized is at least about 5 to 10-fold lower in cultured cells which contain a gene 
deliver/ vehicle of the present invention, as compared to wild-type derived vectors. 
Furthermore, such gene delivery vehicles exhibit reduced, delayed, or no inhibition of 
host cell-directed macromoiecuiar synthesis following introduction into a host cell, as 
compared to wild-type derived vectors. 

20 As discussed in more detail below, the present invention provides: 

(A) sources of wild^type alphaviruses suitable for constructing the gene deliver)' 
vehicles of the present invention; (B) methods for selecting alphaviruses with a desired 
phenotype; (C) construction of alphavirus vector constructs and alphavirus RNA vector 
replicons; (D) construction of Eukaryotic Layered Vector Initiation Systems: 

25 (E) construction of recombinant alphavirus panicles; (F) heterologous sequences which 
may be expressed by the gene delivery vehicles of the present invention; 
(G) construction of alphavirus packaging or producer cell lines; (H) pharmaceutical 
compositions; and (I) methods for utilizing alphavirus-based vectors. 
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A. Sources of Wild-Type Alphavinis 

As noted above, the present invention provides a wide variety of 
alphavirus-based vectors {e.g., RNA vector replicons, alphavinis vector constructs, 
eukaryotic layered vector initiation systems and recombinant alphavinis particles), as 

5 well as methods for utilizing such vector constructs and particles. Briefly, sequences 
encoding wild-type alphaviruses suitable for use in prepanng the above-descnbed 
vectors can be readily obtained given the disclosure provided herein from naiurally- 
occurring sources, or from depositories (e.g., the American Type Culture Collection, 
Rockville ; Maryland). In addition, wild-type alphaviruses may be utilized for 

0 comparing the level of host-cell directed macromolecular synthesis in cells infected 
with the wild-type alphavinis. with the level of host-cell directed macromolecular 
synthesis in cells containing the gene delivery vehicles of the present invention. 

Representative examples of suitable alphaviruses include Aura virus 
(ATCC VR-36S), Bebaru virus (ATCC VR-600, ATCC VR-1240), Cabassou virus 

5 (ATCC VR-922), Chikungunya virus (ATCC VR-64, ATCC VR-1241), Eastern equine 
encephalomyelitis virus (ATCC VR-65, ATCC VR-1242), Fort Morgan virus (ATCC 
VR-924), Getah virus (ATCC VR-369, ATCC VR-1243), Kyzylagach virus (ATCC 
VR-927), Mayaro virus (ATCC VR-66, ATCC VR-1277), Middleburg virus (ATCC 
VR-370), Mucambo virus (ATCC VR-580. ATCC VR-1244), Ndumu virus (ATCC 

0 VR-371), Pixuna virus (ATCC VR-372. ATCC VR-1245). Ross River virus (ATCC 
VR-373, ATCC VR-1246), Semliki Forest virus (ATCC VR-67. ATCC VR-1247), 
Sindbis virus (ATCC VR-68. ATCC VR-1248; see also CMCC #4640 ; described 
below), Tonate virus (ATCC VR-925), Trimti virus (ATCC VR-469), Una virus 
(ATCC VR-374), Venezuelan equine encephalomyelitis virus (ATCC VR-69, ATCC 

5 VR-923, ATCC VR-1250 ATCC VR-1249, ATCC VR-532), Western equine 
encephalomyelitis virus (ATCC VR-70, ATCC VR-1251, ATCC VR-622. ATCC VR- 
1252), Whataroa virus (ATCC VR-926), and Y-62-33 virus (ATCC VR-375). 

For purposes of comparing levels of cellular macromolecular synthesis, 
the following plasmids may also be utilized as a standard source of wild-type aiphavirus 

0 stocks. These plasmids include: for Semliki Forest Virus, pSP6-SFV4 (Liljestrom et 
al., J. Virol. 65:4107-4113, 1991); for Venezuelan equine encephalitis virus, pV2000 
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(Davis et a!., Vir. /t?J:20-3L 1991); for Ross River virus, pRR64 (Kuhn et al., Vir. 
752:430-441. 1991). Briefly, for these plasmids, virus can be obtained from BHK cells 
transfected with in vitro transcribed genomic RNA from the plasmids. For Sindbis 
virus, infectious virus may be isolated directly from BHK cells transfected with 
5 pVGELVIS (ATCC No. 75891) plasmid DNA, or alternatively, obtained as a wild-type 
virus stock (see deposit information provided below regarding ATCC No. VR-2526). 

B. Selection of Alphaviruses With a D esired Phenotvpe 

The duration of in vivo heterologous gene expression from alphavirus- 

10 based vectors is affected by several mechanisms, including inhibition of host cell- 
directed macromolecular synthesis. However, prior to the present invention, there had 
been no obvious method to select for or identify coding or non-coding vector viral- 
specific sequence changes that result in a non-cytopathic phenotvpe. Therefore, within 
one aspect of the present invention methods are provided for isolating and/or 

15 constructing alphavirus-derived gene delivery vehicles with reduced or no inhibition of 
host cell directed macromolecular synthesis. 

1. Biological Selection of Virus Variants 

a. Selection from Virus Stocks Contai ning PI Panicles 

20 One approach for isolating non-cytopathic alphavirus variants exploits 

the presence of defective interfering (DI) panicles in wild-type virus preparations. 
Brieflv, although cenain RNA viruses, for example rhabdoviruses (e.g., vesicular 
stomatitis virus) and alphaviruses (e.g., Sindbis virus and Semliki Forest virus), are 
highly cytopathic, they can nevertheless establish long-term persistent infection in 

25 cultured cells in the presence of DI panicles. DI panicles, by definition, are derived 
from wild-type virus and contain one or more mutations (e.g., deletions, 
rearrangements, nucleotide substitutions, etc.) from the wild-type genome which 
prevent autonomous replication by the DI. In general, the genome of DI panicles is 
smaller and of a lower complexity compared to wild-type virus, and is deleted of 

30 protein-encoding regions while maintaining regions required in cis for replication. Such 
cis sequences often are duplicated and/or rearranged. In the case of cenain alphaviruses 
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(e.g., Sindbis virus), the sequence and organization of DI RNA genomes have been 
analyzed and found to contain a minimum of 50 nt from the extreme 3'-end of the wild- 
type virus genome, and at their 5'-ends. either a wild-type sequence or a cellular tRNA 
[e.g., tRNA Aip ) sequence, in addition to the viral sequence. In all cases, the propagation 
5 and maintenance of the mutated DI genomes requires the co-existence of parental helper 
virus in the infected cell. However, as a result of their genetic structure, DI genome 
replication is vastly superior and comparatively abundant to its wild-type counterpart. 
This characteristic results in interference of wild-type genome replication, the absence 
or low level production of infectious virus, and the establishment of long-term 

1 0 persistent infection of cells. 

Therefore, as described below in Examples 1 and 2, the ability to 
establish long-term persistent infection in permissive cells (e.g., mammalian ceils, 
including cells of human origin) by infecting with a mixed alphavirus stock containing a 
population of DI panicles provides a mechanism to isolate, over time, fully intact virus 

15 variants that are able to establish persistent infection, even in the absence of DI 
particles. Such infectious virus variants can be isolated from long-term persistently 
infected cultures by multiple rounds of plaque purification and have been found to 
initiate productive, persistent, and non-cytopathic infection in the host cells. 
Furthermore, the level of variant virus produced from such a productive, persistent, and 

20 non-cytopathic infection is indistinguishable from wild-type virus infection. This 
observation is in distinct contrast to the previous requirement for establishment of 
persistent infections with virus stocks containing a mixture of DI particles. 

b. Selection from Virus Stocks Not Containing DI Particles 
25 In addition to selection from virus stocks that contain defective- 

interfering particles, virus variants suitable for use within the present invention may be 
obtained from purified virus stocks (without DI panicles) which are either subjected to 
random mutagenesis prior to infection of susceptible cultured cells or allowed to 
generate non-specific mutations during RNA replication with the cultured cells. 
30 Briefly, the initial virus stock may be obtained as a natural isolate or biological vanant 
derived therefrom, or may be generated by transfecting cultured cells with an infectious 
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nucleic acid molecule comprising a genomic cDNA clone or in vitro transcribed RNA. 
If desired, the vims stock may then subjected to physical or chemical mutagenesis 
(although preferred, such mutagenesis is not required). In the case of chemical 
mutagenesis, preferred embodiments utilize a readily available mutagenic agent, for 
5 example nitrous acid, 5-azacytidine, N-methyl-N'-mtro-N-nitrosoguanidine, or 
ethylmethane sulfonate (Sigma. St. Louis. MO), prior to virus infection. Following 
random mutagenesis, specific selection procedures are applied to isolate virus variants 
possessing the desired phenotype, as described in more detail below in Example 2. 

10 2. Genetic Selection of Vims Variants 

In a related approach, mutations may be obtained not using a vims stock, 
but rather, using cloned genomic cDNA of the vims that can be used subsequently to 
transcribe infectious viral RNA in vitro (for example, Sindbis vims (Rice et aL J. Virol. 
67:3809-3819, 1987; Dubensky et al., J. Virol 70:508-519. 1996, SFV (Liljestrom et 

15 ai.. J. Virol 65:4107-41 13, 1991, VEE (Davis et al., Virology 755:20-31, 1991), Ross 
River vims (Kuhn et al.. Virology 752:430-441. 1991), poliovirus (Van Der Werf et al., 
Proc. Natl Acad. Sci. USA 5J:2330-2334, 1986)) or in vivo (Sindbis vims (Dubensky et 
al., ibid.), poliovirus (Racaniello and Baltimore. Science 274:916-919. 1981)). Briefly, 
the infectious nucleic acid is introduced into susceptible cultured cells {e.g., mammalian 

20 cells, including cells of human origin) either directly or following mutagenesis 
performed using one of the above-referenced methods. Alternatively, vector nucleic 
acid may be packaged into particles initially, and the particles used to deliver vector 
into the target cell population for selection. Subsequently, specific selection procedures 
to isolate vims variants possessing the desired phenotype are applied, and are described 

25 below. 

In certain embodiments, random mutagenesis may be performed initially 
by propagation of the piasmid containing viral cDNA in the XLl-Red strain of E. coli 
(Stratagene, San Diego, CA), which is deficient in three of the primary DNA repair 
pathways, resulting from mutS, mutD, and mutT mutations. However, other 
30 mutagenesis procedures including, but not limited to. linker- scanning mutagenesis 
(Haltiner et ah. Nucleic Acids Res. 75:1015, 1985; Barany, Proc. Natl. Acad. Sci. USA 
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£2:4202. 1985), random oligonucleotide-directed mutagenesis (Kunkel et al.. Methods 
Enzymoi 155:166. 1987; Zoller and Smith, Methods Enzymol. 754:329, 1987; Hill 
etal., Methods Enzymol 755:558, 1987; Hermes et al M Gene 54:143, 1989) and PCR 
mutagenesis (Herlitze and Koenen, Gene 91:143. 1 990), can be readily substituted 
5 utilizing published protocols. The resulting mixed population of mutated cDNA clones 
is introduced into susceptible cultured cells directly, or after transcription in vitro. 
Enrichment for transfected cells which contain mutated virus of the desired phenotype 
is accomplished based on increased survival time over wild-type virus infected cells, as 
described below, 

10 

3. Genetic Selection of Variants Using Virus-Derived Vectors 

In another approach, mutations may be generated in any region a virus- 
derived expression vector, including the regulator)', untranslated regions, or protein- 
encoding gene regions. For example, within one aspect of the invention methods are 

15 provided for selecting viral variants with reduced or no inhibition of host-cell directed 
macromolecular synthesis, comprising the steps of: (a) introducing into a cell a 
eukaryotic layered vector initiation system, RNA vector replicon. or recombinant 
alphavirus panicle which directs the expression of a immunogenic cell surface protein 
(suitable for detection of vector containing cells), or alternatively, a selectable marker 

20 (either a drug or non-drug marker wherein non-vector containing cells are killed upon 
addition of. for example, a drug such as neomycin, hygrornycin, phleomycin, gpt, 
puromycin. or histidinol); (b) incubation or culturing the cells under conditions and for 
a time sufficient to select vector containing ceils which exhibit the desired phenotype; 
followed by (c) isolating cells which contain the vector of the desired phenotype and (d) 

25 analysis of the vector for the causal mutation. 

As noted above, the viral vectors of the present invention may be derived 
from a wide variety of viruses (e.g., Sindbis virus (Xiong et al.. Science 243:1188-1191. 
1989; Hahn et al., Proc. Natl. Acad. Sci. USA 59:2679-2683, 1992; Schlesinger, Trends 
Biotechnol. 77:18-22, 1993; Dubensky et al., ibid), Semliki Forest virus (Liljestrom 

30 and Garoff. Bio/Technology 9:1356-1361, 1991), Venezuelan equine encephalitis virus 
(Davis et al., J. Cell. Biochem. Suppl. 79/4:310, 1995), poliovirus (Choi et ah, J. Virol. 
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(55:2875-2883. 1991; Ansardi et aL Cancer Res. 54:6359-6364, 1994; and Andino et 
al.. Science 265: 1448-1 451, 1994). Representative examples of the above-described 
methods are discussed in more detail below within Example 2. 

5 4. Use of Viral Variants 

As discussed in more detail herein, viral variants which have been 
selected or generated utilizing the methods provided herein may be utilized to construct 
a wide variety of recombinant gene delivery vehicles which exhibit the desired 
phenotype. Within certain embodiments, the gene delivery vehicle contains a mutation 

0 within the Leu-Xaa-Pro-Gly-Gly ("LXPGG") motif of the nsP2 gene. Briefly, for 
alphaviruses wherein published sequence of the nsP2 gene is available, a highly 
conserved amino acid motif -Leu-Xaa-Pro-Gly-Gly- ("LXPGG") is observed. As 
predicted by standard protein modelling algorithms (Chou and Fasman, Adv. Enzym. 
47:45- 148, 1978), the residues of this motif possibly comprise a [3 turn in the structure. 

5 Proline 726 of nsP2 in Sindbis virus is the central residue of this motif. The 
corresponding motif in other alphaviruses is illustrated in the table below. 



Alphavir us Strain* 

1 . Sindbis virus 

2. S.A.ARS6 virus 

3. Ockelbo virus 

4. Aura virus 

5. Semliki Forest virus 

6. VEE virus 

7. Ross River virus 



Pro-Gly-Gly Region 
Leu-Asn-Pro-Gly-Gly-Thr 
Leu-Asn-Pro-Gly-Gly-Thr 
Leu-Asn-Pro-Gly-Gly-Thr 
Leu-Lys-Pro-Gly-Gly-Thr 
Leu-Lys-Pro-GIy-Gly-Ile 
Leu-Asn-Pro-Gly-Gly-Thr 
Leu-Xaa-Pro-Gly-Gly-Ser 



nsP2 a.a/sfP-G-G) 
a.a. = 726-728 
a.a. - 726-728 
a.a. = 726-728 
a.a. = 725-727 
a.a. = 718-720 
a.a. = 713-715 
a.a. = 717-719 



*Alphavirus strains with published nsP2 sequences: (I) Strauss et al.. Virology 133:92- 
0 110, 1984; (2) Simpson et al., Virology 222:464-469 1996; (3) Shirako et al., Virology 
182:153-164, 1991; (4) Rumenapf et al., Virology 205:621-633, 1995; (5) Takkinen, 
Nucleic Acids Res. 74:5667-5682, 1986; (6) Kinney et al., Virology 170:19-30, 1989; 
and (7) Faragher et al, Virology 163:509-526, 1988. 

5 Hence, within various embodiments of the present invention, gene 

delivery vehicles are provided wherein the gene delivery vehicle contains an nsP2 gene 
with a mutation in the LXPGG motiff. Within one embodiment, the Leu codon is 
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mutated to another amino acid selected from the group consisting of Ala. Arg, Asn. 
Asp, Asx, Cys, Gin. Glu. Glx, Gly, His, He, Lys, Met, Phe, Pro, Ser. Thr, Trp, TyT, Val. 
or another rare or non-protein amino acid (see. e.g., Lehninger, Biochemistry, Worth 
Publishers. Inc.. X.Y. N.Y., 1975). Within another embodiment, the Pro codon is 
5 mutated to another amino acid selected from the group consisting of Ala. Arg, Asn, 
Asp, Asx, Cys, Gin, Glu, Glx. Gly, His, He, Leu, Lys, Met, Phe, Ser, Thr, Trp, Tyr, Val, 
or another rare or non-protein amino acid. Within other embodiments, either or both of 
the Gly codons may be mutated to another amino acid selected from the group 
consisting of Ala. Arg, Asn. Asp, Asx, Cys, Gin. Glu, Glx, His, lie. Leu, Lys, Met, Phe, 

1 0 Pro, Ser, Thr. Trp ; Tyr. Val. or another rare or non-protein amino acid. Within yet other 
embodiments, the Xaa amino acid, or amino acids between 1 and 3 residues upstream or 
downstream of the LXPGG motiff may be mutated from the wild-type amino acid in 
order to effect the phenotype of the resultant gene delivery vehicle. Within certain 
embodiments of the invention, the LXPGG motiff may be mutated to contain more than 

1 5 one codon alteration, or alternatively, one or more codon insertions or deletions. 

C. Alphavirus Vector Constructs and Alphavirus R NA Vector Replicons 

As noted above, the present invention provides both DNA and RNA 
constructs which are derived from alphaviruses. Briefly, within one aspect of the 

20 present invention alphavirus vector constructs are provided, comprising a 5' promoter 
which initiates synthesis .of viral RNA in vitro from cDNA, a 5' sequence which 
initiates transcription of alphavirus RNA, a nucleic acid molecule which operably 
encodes all four alphaviral nonstructural proteins including an isolated nucleic acid 
molecule as described above, an alphavirus RNA polymerase recognition sequence and 

25 a 3* polyadenylate tract. Within other aspects, RNA vector replicons are provided, 
comprising a 5' sequence which initiates transcription of alphavirus RNA, a nucleic acid 
molecule which operably encodes all four alphaviral nonstructural proteins, including 
the isolated nucleic acid molecules discussed above, an alphavirus RNA polymerase 
recognition sequence and a 3' polyadenylate tract. Within preferred embodiments of the 

30 above, the above constructs further comprise a viral junction region. Each of these 
aspects are discussed in more detail below. 
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1. .V Promoters which initiate synthesis of virai RNA 

As noted above, within certain embodiments of the invention, alphavirus 
vector constructs are provided which contain 5' promoters which (e.g., DNA dependent 
5 RNA polymerase promoters) initiate synthesis of viral RNA from cDNA by a process 
of in vitro transcription. Within preferred embodiments such promoters include, for 
example, the bacteriophage T7, T3, and SP6 RNA polymerase promoters. Similarly, 
eukarvioic layered vector initiation systems are provided (e.g., DNA dependent RNA 
polymerase promoters) which contain 5' promoters which initiate synthesis of viral 

10 RNA from cDNA in vivo (i.e., within a cell). Within certain embodiments, such RNA 
polvmerase promoters {for either alphavirus vector constructs or eukaryotic layered 
vector initiation systems) may be derived from both prokaryotic and eukaryotic 
organisms, and include, for example, the bacterial p-galactosidase and trpE promoters, 
and the eukaryotic viral simian virus 40 (SV40) (e.g., early or late), cytomegalovirus 

15 (CMV) (e.g., immediate early), Moloney murine leukemia virus (MoMLV) or Rous 
sarcoma virus (RSV) LTR, and herpes simplex virus (HSV) (thymidine kinase) 
promoters. 

2. Sequences Which Initiate Transcription 

20 As noted above, within preferred embodiments the alphavirus vector 

constructs and RNA vector replicons of the present invention contain a 5' sequence 
which is capable of initiating transcription of an alphavirus RNA (also referred to as 5'- 
end CSE, or 5' cis replication sequence). Representative examples of such sequences 
include nucleotides 1-60, and to a lesser extent nucleotides through bases 150-210, of 

25 the wild-type Sindbis virus, nucleotides 10-75 for tRNA Asp (aspartic acid, Schlesinger 
etaL U.S. Patent No. 5,091,309), and 5' sequences from other alphaviruses which 
initiate transcription. It is the complement of these sequences, which corresponds to the 
3' end of the of the minus-strand genomic copy, "which are bound by the nsP replicase 
complex, and possibly additional host cell factors, from which transcription of the 

30 positive-strand genomic RNA is initiated. 
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3. Alphavirus Nonstructural Proteins 

The alphavirus vector constructs and RNA vector replicons provided 
herein also require sequences encoding all four alphaviral nonstructural proteins, 
5 including a sequence which provides the desired phenotype discussed above. Briefly, a 
wide varietv of sequences which encode alphavirus nonstructural proteins, in addition to 
those explicitly provided herein, may be utilized in the present invention, and are 
therefore deemed to fall within the scope of the phrase "alphavirus nonstructural 
proteins." For example, due to the degeneracy of the genetic code, more than one codon 

10 mav code for a given amino acid. Therefore, a wide variety of nucleic acid sequences 
which encode alphavirus nonstructural proteins may be generated. Furthermore, amino 
acid substitutions, additions, or deletions at any of numerous positions may still provide 
functional or biologically active nonstructural proteins. Within the context of the 
present invention, alphavirus nonstructural proteins are deemed to be biologically active 

15 if they promote self-replication of the vector construct, i.e., replication of viral nucleic 
acids and not necessarily the production of infectious virus, and may be readily 
determined by metabolic labeling or RNase protection assays performed over a time 
course. Methods for making such derivatives are readily accomplished by one of 
ordinary skill in the art given the disclosure provided herein. 

20 Alphaviruses express four nonstructural proteins, designated nspl, nsp2, 

nsp3, and nsp4. Vectors of the present invention derived from alphaviruses should 
contain sequences encoding the four nonstructural proteins. In wild-type Sindbis virus, 
nonstructural proteins 1-3 are encoded by nucleotides 60 to 5747, while nsP4 is encoded 
by nucleotides 5769 to 7598 (see Figure 1). The nonstructural proteins are translated 

25 from the genomic positive strand RNA as one of two large polyproteins, known as PI 23 
or PI 234, respectively, depending upon (i) whether there is an opal termination codon 
between the coding regions of nsP3 and nsP4 and (ii) if there is such an opal codon 
present, whether there is translation termination of the nascent polypeptide at that point 
or readthrough and hence production of PI 234. The opal termination codon is present 

30 at the nsP3/nsP4 junction of the alphaviruses SIN (strain AR339 and the SIN-1 strain 
described herein), AURA, WEE, EEE, VEE, and RR, and thus the P123 and P1234 
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species are expressed in cells infected with these viruses. In contrast, no termination 
codon is present at the nsP3/nsP4 junction of the alphaviruses SIN (strain ARS6, SF, 
and ONN). and thus only the PI 234 species is expressed in cells infected with these 
viruses. Both the polyprotein and processed monomelic forms of the nonstructural 

5 proteins function in the replication of the alphavirus RNA genome. Experiments 
examining growth characteristics of alphavirus nonstructural protein cleavage mutants 
have indicated that the polyproteins are involved in the synthesis of the genomic 
negative stranded RNA, while the individual monomeric proteins catalyze the synthesis 
of the genomic and subgenomic positive stranded RNA species (Shirako and Strauss, J. 

0 Virol. 68: 1874- 1885. 1994). Translational readthrough generally occurs about 10%- 
20% of the time in cells infected with wild type Sindbis virus containing the opal 
termination codon at the nsP3/nsP4 junction. Processing of P123 and P1234 is by a 
proteinase activity encoded by the one of the nonstructural proteins, and is discussed 
further below. The order of processing, whether in cis or in trans, depends on various 

5 factors, including the stage of infection. For example, Sindbis virus and SFV produce 
PI 23 and nsp4 early in infection, and PI 2 and P34 later in infection. Further processing 
then releases the individual nonstructural proteins. Each nonstructural protein has 
several functions, some of which are described below. 

0 a. nsPl 

Nonstructural protein 1 is required for the initiation of (or continuation 
of) minus-strand RNA synthesis. It also plays a role in capping the 5' terminus of 
genomic and subgenomic alphavirus RNAs during transcription, as nsPl possesses both 
methy {transferase (Mi and Stollar. Vir. / £4:423-427, 1991) and guanyl transferase 

5 activity (Strauss and Strauss, Microbiol. Rev, J5(3):49 1-562, 1994). NsPl also 
modulates the proteinase activity of nsP2, as polyproteins containing nsPl inefficiently 
cleave between nsP2 and nsP3 (de Groot et al., EMBOJ. P:263 1-2638, 1990). 

b. nsE2 

0 Nonstructural protein 2 is a multifunctional protein, involved in the 

replication of the viral RNA and processing of the nonstructural polyprotein. The N- 
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terminal domain of the protein (spanning about the first 460 ammo acids) is believed to 
be a helicase which is active in duplex unwinding during RNA replication and 
transcription- Synthesis of 26S subgenomic mRNA, which, in vectors according to the 
present invention, encodes the gene(s) of interest, requires functional nsP2. The C- 
5 terminal domain of nsP2. between amino acid residues 460-807 of Sindbis virus, 
proteolvtically cleaves in trans and in cis the nonstructural polyprotein between the 
nsPl/nsP2, nsP2/nsP3. and nsP3/nsP4 junctions. Alignment of the primary sequences 
of the alphavirus nsP2 C-terminal domains suggests that nsP2 is a papain-Iike 
proteinase (Hardy and Strauss. J. Virol. tfj:4653-4664, 1988). 

10 Other observed characteristics of nsP2 have not. as yet, been assigned a 

function directly related to the propagation of alphaviruses. For example, it has been 
shown that nsP2 is closely associated with ribosomes in SFV-infected cells, and can be 
cross-linked to rRNA by UV irradiation (Ranki et al., FEBS Leu. 108:299-302, 1979). 
Further, 50% of nsP2 is localized in the nuclear matrix, particularly in the area of the 

15 nucleoli of SFV-infected BHK cells (Peranen et al., J. Virol. 54:1888-1896, 1990). 
Localization of nsP2 to the nuclei presumably proceeds by active transport, as it 
exceeds the size of small proteins and metabolites (about 20-60 kD), which can enter 
the nucleus by diffusion through nuclear core complexes (Paine et al., Nature 254:109- 
1 14, 1975). Putative NLS sequences have been identified in the alphaviruses SFV : SIN, 

20 RR. ONN. OCfC and VEE (Rikkonen et al., Vir. y £9:462-473, 1992). 

c. n sP3 

Nonstructural protein nsP3 contains two distinct domains, although their 
precise roles in viral replication are not well understood. The N-terminal domain ranges 

25 in length from 322 to 329 residues in different alphaviruses and exhibits a minimum of 
51% amino acid sequence identity among any two alphaviruses. The C-terminal 
domain, however, is not conserved among known alphaviruses in length or in sequence, 
and multiple changes are tolerated (Li et al.. Virology, 7 79:416-427). The protein is 
found associated with replication complexes in a heavily phosphorylated state. In 

30 alphaviruses whose genomes contain an opal termination codon between the nsP3/nsP4 
junction, two different proteins are produced depending upon whether or not there is 
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readthrough of the opal termination signal. Readthrough results in an nsP3 protein 
which contains 7 additional carboxy terminal amino acids after cleavage of the 
polyprotein. It is clear that nsP3 is required in some capacity for viral RNA synthesis, 
as particular mutants of this protein are RNA negative, and the PI 23 polyprotein is 
5 required for minus-strand RNA synthesis. 

d. nsP4 

NsP4 is the virus-encoded RNA polymerase and contains the GDD motif 
characteristic of such enzymes (Kamer and Argos, Nucleic Acids Res. 1 2:7269-72S2, 

0 1984). Thus. nsP4 is indispensable for alphavirus RNA replication. The concentration 
of nsP4 is tightly regulated in infected cells. In most alphaviruses. translation of nsP4 
requires readthrough of an opal codon between the nsP3 and nsP4 coding regions, 
resulting in lower intracellular levels as compared to other nonstructural proteins. 
Additionally, the bulk of nsP4 is metabolically unstable, through degradation by the N- 

5 end rule pathway (Gonda et aL J. Biol. Chem. 264:16700-16712. 1989). However, 
some nsP4 is stable, due to its association with replication complexes which conceal 
degradation signals. Thus, stabilization of the enzyme by altering the amino terminal 
residue may prove useful in promoting more long term expression of proteins encoded 
by the vectors described herein. Stabilizing amino terminal residues include 

0 methionine, alanine, and tyrosine. 

4. Viral junction Regions 

The alphavirus viral junction region normally controls transcription 
initiation of the subgenomic mRNA; thus, this element is also referred to as the 

5 subgenomic mRNA promoter. In the case of Sindbis virus, the normal viral junction 
region typically begins at approximately nucleotide number 7579 and continues through 
at least nucleotide number 7612 {and possibly beyond). At a minimum, nucleotides 
7579 to 7602 (5*- ATC TCT ACG GTG GTC CTA AAT AGT - SEQ. ID NO. 1) are 
believed necessary for transcription of the subgenomic fragment. This region 

0 (nucleotides 7579 to 7602) is hereinafter referred to as the "minimal junction region 
core." 
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Within certain aspects of the invention, the viral junction region is 
inactivated in order to prevent synthesis of the subgenomic fragment. As utilized 
within the context of the present invention, "inactivated" means that the species 
corresponding to subgenomic mRNA is not observed in autoradiograms from 
5 denaturing gels of electrophoresed RNA purified from cells containing these vectors 
and treated with 1 ug/ml dactinomycin and labeled with [ 3 H]-uridine, as described 
(Frolovand Schlesinger, J. Virol. (55:1721-1727, 1994). 

Within one embodiment of the invention, gene delivery vehicles may be 
constructed by the placement of signals promoting either ribosome readthrough or 
10 internal nbosome entry immediately downstream of the disabled junction region 
promoter. In this vector configuration, synthesis of subgenomic message cannot occur; 
however, the heterologous proteins are expressed from genomic length mRNA by either 
ribosomal readthrough (scanning) or internal ribosome entry. 

In certain applications of the gene delivery vehicles described herein, the 
15 expression of more than one heterologous gene is desired. For example, in order to 
treat metabolic disorders such as Gaucher's syndrome, multiple administrations of gene 
delivery vehicles or panicles may be required, since duration of the therapeutic 
palliative may be limited. Therefore, within certain embodiments of the invention it 
may be desirable to co-express in a target cell the Adenovirus 2 E3 gene, along with a 
20 therapeutic palliative, such as the glucocerebrosidase gene. In wild-type virus, the 
structural protein (sP) polycistronic message is translated into a single polyprotein 
which is processed subsequently into individual proteins in part by the sP capsid 
proteinase. Thus, expression of multiple heterologous genes from a polycistronic 
message requires a mechanism different from the wild-type virus, since the protease 
25 activity of the capsid sP, or the peptides recognized for cleavage, are not present in the 
replacement region of the alphavirus vectors. Therefore, within further embodiments of 
the invention, functional elements which permit translation of multiple independent 
heterologous sequences, including ribosomal readthrough, cap-independent translation, 
internal ribosome entry, or minimal junction region core sequences, can be utilized. 

30 
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5. Aiphavirus RNA polymerase recogn ition sequence, and pol yf A) tract 

As noted above, aiphavirus vector constructs or RNA vector replicons of 
the present invention also should include an aiphavirus RNA polymerase recognition 
sequence (also termed "aiphavirus replicase recognition sequence". "3' terminal CSE", 
5 or "3' cis replication sequence"). Briefly, the aiphavirus RNA polymerase recognition 
sequence, which is located at the 3' end region of positive stranded genomic RNA, 
provides a recognition site at which the virus begins replication by synthesis of the 
negative strand. A wide variety of sequences may be utilized as an aiphavirus RNA 
polymerase recognition sequence. For example, within one embodiment. Sindbis virus 

10 vector constructs in which the polymerase recognition is truncated to the smallest region 
that can still function as a recognition sequence {e.g., nucleotides 1 i .684 to 1 1,703) can 
be utilized. Within another embodiment of the invention, Sindbis virus vector 
constructs in which the entire nontranslated region downstream from the El sP gene to 
the 3' end of the viral genome including the polymerase recognition site {e.g., 

15 nucleotides 1 1,382 to 1 1,703), can be utilized. 

Within preferred embodiments of the invention, the aiphavirus vector 
construct or RNA vector replicon may additionally contain a poly(A) tract, which 
increases dramatically the observed level of heterologous gene expression in cells 
transfected with alphavirus-derived vectors {see e.g., Dubensky et al, supra). Briefly, 

20 the poly(A) tract may be of any size which is sufficient to promote stability in the 
cytoplasm, thereby increasing the efficiency of initiating the viral life cycle. Within 
various embodiments of the invention, the poly(A) sequence comprises at least 10 
adenosine nucleotides, and most preferably, at least 25 adenosine nucleotides. Within 
one embodiment, the poly(A) sequence is attached directly to Sindbis virus nucleotide 

25 11,703. 

D. Eukaryotic Layered Vector Initiation Systems 

Due to the size of a full-length genomic aiphavirus cDNA clone, in vitro 
transcription of full-length, capped RNA molecules is rather inefficient. This results in 
30 a lowered transfection efficiency, in terms of infectious centers of virus (as measured by 
plaque formation), relative to the amount of in vitro transcribed RNA transfected. Such 
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inefficiency is aiso relevant to the in vitro transcription of alphavirus expression 
vectors. Testing of candidate cDNA clones and other alphavirus cDNA expression 
vectors for their ability to initiate an infectious cycle or to direct the expression of a 
heterologous sequence can thus be greatly facilitated if a cDNA clone is transfected into 
5 susceptible cells as a DNA molecule, which then directs the synthesis of viral RNA 
in vivo. 

Therefore, within one aspect of the present invention DNA-based vectors 
(referred to as "Eukaryotic Layered Vector Initiation Systems 1 ') are provided that are 
capable of directing the synthesis of viral RNA (genomic or vector) in vivo. Generally, 

10 eukaryotic layered vector initiation systems comprise a 5' promoter that is capable of 
initiating in vivo (i.e.. within a cell) the 5' synthesis of RNA from cDNA. a construct 
that is capable of directing its own replication in a cell, the construct also being capable 
of expressing a heterologous nucleic acid sequence, and a 3' sequence that controls 
transcription termination (e.g., a polyadenylale tract). Such eukaryotic layered vector 

15 initiation systems provide a two-stage or "layered" mechanism that controls expression 
of heterologous nucleotide sequences. Briefly, the first layer initiates transcription of 
the second layer and comprises a promoter that is capable of initiating in vivo the 5' to 3' 
synthesis of RNA from cDNA {e.g., a 5' eukaryotic promoter), and may further 
comprise other elements, including a 3' transcription terrninatiorvpolyadenyiation site. 

20 one or more splice sites, as well as other RNA nuclear export elements, including, for 
example, the hepatitis B virus posttranscriptional regulatory element (PRE) (Huang et 
ah, Mol Cell. Biol. 13:7416. 1993; Huang et aL J. Virol. (55:3193, 1994; Huang et aL 
Mol. Cell Biol. 75:3864-3869, 1995), the Mason-Pfizer monkey virus constitutive 
transport element (CTE) (Bray et al., Proc. Natl Acad. Sci. USA 97:1256-1260, 1994), 

25 the HIV Rev responsive element (Malim et aL, Nature 555:254-257, 1989; Cullen et al., 
Trends Biochem. Sci. 16:346, 1991), and other similar elements, if desired. 
Representative promoters suitable for use within the present invention include both 
eukaryotic (e.g., pol I, II, or III) and prokaryotic promoters, and inducible or non- 
inducible (i.e., constitutive) promoters, such as, for example, Moloney murine leukemia 

30 virus promoters, metallothionein promoters, the glucocorticoid promoter, Drosophila 
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protein 70 promoter, immunoglobulin promoters, mouse polyoma virus promoter (Py), 
Rous sarcoma Virus (RSV), herpes simplex virus (HSV) promoter, BK virus and JC 
vims promoters, mouse mammary tumor virus (MMTV) promoter, alphavirus junction 
5 region. CMV promoter. Adenovirus El or VA1RNA promoters. rRNA promoters, 
tRNA methionine promoter. CaMV 35 S promoter, nopaline synthetase promoter, 
tetracycline responsive promoter, and the lac promoter. 

Within yet other embodiments of the invention, inducible promoters may 
be utilized. For example, within one embodiment inducible promoters are provided 

10 which initiate the synthesis of RNA from DNA. comprising a core RNA polymerase 
promoter sequence, and an operably linked nucieic acid sequence that directs the DNA 
binding of a transcriptional activator protein, and an operably linked nucleic acid 
sequence that directs the DNA binding of a transcriptional repressor protein. Within a 
further embodiment, the nucleic acid sequence that directs the DNA binding of a 

15 transcriptional activator protein is a sequence that binds a tetracycline repressor/VP 1 6 
transactivator fusion protein. Within yet another embodiment, the nucleic acid 
sequence that directs the DNA binding of a transcription repressor protein is a sequence 
that binds a lactose repressor / Kruppel domain fusion protein. 

The second layer comprises an autocatalytic vector construct which is 

20 capable of expressing one or more heterologous nucleotide sequences and of directing 
its own replication in a cell, either autonomously or in response to one or more factors 
(e.g. is inducible). The second layer may be of viral or non-viral origin. Within one 
embodiment of the invention, the second layer construct may be an alphavirus vector 
construct as described above. 

25 A wide variety of vector systems may be utilized as the first layer of the 

eukaryotic layered vector initiation system, including for example, viral vector 
constructs developed from DNA viruses such as those classified in the Poxviridae, 
including for example canary pox virus or vaccinia virus {e.g., Fisher-Hoch et al.. PNAS 
5(5:317-321, 1989; Flexner et al., Ann. N.Y. Acad. Sci. .550:86-103, 1989; Flexner et al., 

30 Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330 and 5,017,487; WO 
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S9/01973); Papoviridae such as BKV, JCV or SV40 (e.g., Mulligan et aL Nature 
277:108-114, 1979); Adenoviridae. such as adenovirus (e.g.. Berkner. Biotechniques 
(5:616-627. 198S; Rosenfeld et aL Science 252:431-434, 1991); Parvoviridae, such as 
adeno-associated virus (e.g.. Samulski et aL J- Vir. o~J:3822-382S. 1989; Mendelson 
5 et aL Virol 766:154-165. 1988: PA 7/222.684); Herpesviridae, such as Herpes Simplex 
Virus (e.g.. Kit. Adv. Exp. Med. Biol 2/5:219-236, 1989); and Hepadnaviridae (e.g., 
HBV), as well as certain RNA viruses which replicate through a DNA intermediate, 
such as the Retrovindae {see. e.g.. U.S. Patent No. 4 7 777,127, GB 2,200.65 L EP 
0.345,242 and W09 1/02805: Retroviridae include leukemia in viruses such as MoMLV 
10 and immunodeficiency viruses such as HIV. e.g., Poznansky, J. Virol. (55:532-536, 
1991). 

Similarly, a wide variety of vector systems may be utilized as second 
laver of the eukaryotic layered vector initiation system, including for example, vector 
svstems derived from viruses of the following families: Picomaviridae (e.g., poliovirus, 

15 rhinovirus. coxsackieviruses), Caliciviridae, Togaviridae (e.g., alphavirus, rubella), 
Flaviviridae (e.g., yellow fever. HCV), Coronaviridae (e.g., HCV, TGEV, LBV, MHV, 
BCV), Bunyaviridae. Arenaviridae. Retroviridae (e.g., RSV, MoMLV, HIV, HTLV), 
hepatitis delta virus and Astrovirus. In addition, non-mammalian RNA viruses (as well 
as components derived therefrom) may also be utilized, including for example, bacterial 

20 and bacteriophage replicases. as well as components derived from plant viruses, such as 
potexviruses (e.g., PVX), carlaviruses (e.g., PVM), tobraviruses {e.g., TRV, PEBV, 
PRV), Tobamoviruses (e.g., TMV, ToMV, PPMV), luteoviruses (e.g., PLRV), 
potyviruses (e.g., TEV, PPV, PVY), tombusviruses (e.g., CyRSV), nepoviruses (e.g., 
GFLV), bromoviruses (e.g., BMV), and topamo viruses. 

25 The replication competency of the autocatalytic vector construct, 

contained within the second layer of the eukaryotic vector initiation system, may be 
measured by a variety of assays known to one of skill in the art including, for example, 
ribonuclease protection assays which measure increases of both positive-sense and 
negative-sense RNA in transfected cells over time, in the presence of an inhibitor of 

30 cellular RNA synthesis, such as dactinomycin. and also assays which measure the 
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synthesis of a subgenomic RNA or expression of a heterologous reporter gene in 
transfected cells. 

Within particularly preferred embodiments of the invention, eukaryotic 
layered vector initiation systems are provided that comprise a 5' promoter which is 
5 capable of initiating in vivo the synthesis of alphavirus RNA from cDNA (i.e., a DNA 
promoter of RNA synthesis), followed by a 5' sequence which is capable of initiating 
transcription of an alphavirus RNA. a nucleic acid sequence which operably encodes all 
four alphaviral nonstructural proteins (including a nucleic acid molecule as described 
above which, when operably incorporated into a recombinant alphavirus particle, results 

10 in the desired phenotype), an alphavirus RNA polymerase recognition sequence, and a 
3' sequence which controls transcription termination/polyadenylation. In addition, a 
viral junction region which is operably linked to a heterologous sequence' to be 
expressed may be included. Within various embodiments, the viral junction region may 
be modified, such that viral transcription of the subgenomic fragment is increased, 

1 5 reduced, or inactivated. Within other embodiments, a second viral junction region may 
be inserted following the first inactivated viral junction region, the second viral junction 
region being either active or modified such that viral transcription of the subgenomic 
fragment is increased or reduced. 

Following in vivo transcription of the eukaryotic layered vector initiation 

20 system, the resulting alphavirus RNA vector replicon molecule is comprised of a 5' 
sequence which is capable of initiating transcription of an alphavirus RNA, a nucleotide 
sequence encoding biologically active alphavirus nonstructural proteins, a viral junction 
region, a heterologous nucleotide sequence, an alphavirus RNA polymerase recognition 
sequence, and a polyadenylate sequence. 

25 Various aspects of the alphavirus cDNA vector constructs have been 

discussed above, including the 5' sequence which is capable of initiating transcription of 
an alphavirus, the nucleotide sequence encoding alphavirus nonstructural proteins, the 
viral junction region, including junction regions which have been inactivated such that 
viral transcription of the subgenomic fragment is prevented, and the alphavirus RNA 
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polymerase recognition sequence. In addition, modified junction regions and tandem 
junction regions have also been discussed above. 

In another embodiment of the invention, the eukaryotic layered vector 
initiation system is derived from an alphavirus vector, such as a Sindbis vector 
5 construct, which has been adapted to replicate in one or more cell lines from a particular 
eukaryotic species, especially a mammalian species, such as humans. For instance, if 
the gene encoding the recombinant protein to be expressed is of human origin and the 
protein is intended for human therapeutic use. production in a suitable human cell line 
may be preferred in order that the protein be post-translationally modified as would be 

10 expected to occur in humans. This approach may be useful in further enhancing 
recombinant protein production (as discussed in more detail below). Given the overall 
plasticity of an alphaviral genome due to the infidelity of the viral replicase, variant 
strains with an enhanced ability to establish high titer productive infection in selected 
eukaryotic cells (e.g., human, murine, canine, feline, etc.) can be isolated. Additionally, 

15 variant alphaviral strains having an enhanced ability to establish high titer persistent 
infection in eukaryotic cells may also be isolated using this approach. Alphavirus 
expression vectors can then be constructed from cDNA clones of these variant strains 
according to procedures provided herein. 

Within another embodiment of the invention, the eukaryotic layered 

20 vector initiation system comprises a promoter for initial alphaviral vector transcription 
that is transcriptionally active only in a differentiated cell type. Briefly, it is well 
established that alphaviral infection of mammalian cells in culture, such as those 
derived from hamster {e.g., baby hamster kidney cells) or chicken (e.g., chicken embryo 
fibroblasts), typically results in cytoxicity. Thus, to produce a stably transformed or 

25 transfected host cell line, the eukaryotic layered vector initiation system may be 
introduced into a host cell wherein the promoter which enables the initial vector 
amplification is a transcriptionally inactive, but inducible, promoter, in a particularly 
preferred embodiment, such a promoter is differentiation state dependent. In this 
configuration, activation of the promoter and subsequent activation of the alphavirus 

30 DNA vector coincides with induction of cell differentiation. Upon growth to a certain 
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cell number of such a stably transformed or transfected host cell line, the appropriate 
differentiation stimulus is provided, thereby initiating transcription of the vector 
construct and amplified expression of the desired gene and encoded polypeptide(s). 
Many such differentiation state-dependent promoters are known to those in the an, as 
5 are cell lines which can be induced to differentiate by application of a specific stimulus. 
Representative examples include cell lines F9 and PI 9, HL60, and Freund 
erythroleukemic cell lines and HEL. which are activated by retinoic acid, horse serum, 
and DMSO. respectively. 

In a preferred embodiment, such promoters can be regulated by two 

10 separate components. For example, as described in Example 7, binding sites for both a 
transcriptional activator and a transcriptional repressor are positioned adjacent to a 
"core" promoter, in an operably-dependent manner. In this configuration, the 
uninduced state is maintained by blocking the ability of the transcriptional activator to 
find its recognition site, while allowing the transcriptional repressor to be constitutively 

15 expressed and bound to its recognition site. Induction is permitted by blocking the 
transcriptional repressor and removing the transactivator block. For example, a 
tetracycline-responsive promoter system (Gossen and Bujard, Proc. Natl Acad. Sci. 
59:5547-5551, 1992) may be utilized for inducible transcription of an aiphavirus vector 
RNA. In this system, the expression of a tetracycline repressor and HSV-VP16 

20 transactivator domain, as a "fusion" protein (rTA), stimulates in vivo transcription of the 
aiphavirus vector RNA by binding specifically to a tetracycline operator sequence 
(tetO) located immediately adjacent to a minimal "core" promoter (for example, CMV). 
The binding and transactivatton event is reversibly blocked by the presence of 
tetracycline, and may be "turned on" by removing tetracycline from the culture media. 

25 As uninduced basal levels of transcription will vary among different cell types, other 
different minimal core promoters (for example HSV-tk) may be linked to the 
tetracycline operator sequences, provided the transcription start site is known, to allow 
juxtaposition at or in the immediate proximity of aiphavirus vector nucleotide 1. 

The rTA transactivator can be provided by an additional expression 

30 cassette also stably transformed into the same cell line; and in certain embodiments, the 
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rTA expression cassette may itself be autoregulatory. The use of an autoregulatory rTA 
expression cassette circumvents potential toxicity problems associated with constitutive 
high level expression of rTA by linking expression to transcriptional control by the 
same tetO-linked promoter to which rTA itself binds. This type of system creates a 
5 negative feedback cycle that ensures very little rTA is produced in the presence of 
tetracycline, but becomes highly active when the tetracycline is removed. Such an 
autoregulatory rTA expression cassette is provided in plasmid pTet-tTAk (Shockett 
et al., Proc. Natl. Acad. Set. USA 92:6522-6526, 1995). 

For transcriptional repression, the JCRA_B repression domain of a certain 

10 zinc finger proteins can also be utilized. Briefly, KJIAB (Kruppel- associated box) 
domains are highly conserved sequences present in the aniino-terminal regions of more 
than one-third of all Kriippel-class Cys : His : zinc finger proteins. The domains contain 
two predicted amphipathic a-helicies and have been shown to function as DNA 
binding-dependent RNA polymerase II transcriptional repressors (for example, Licht 

15 et al., Nature 346: 76-79, 1990). Like other transcription factors, the active repression 
domain and the DNA-binding domain are distinct and separable. Therefore, the 
repression domain can be linked as a fusion protein to any sequence specific DNA 
binding protein for targeting. Thus, the DNA binding protein component can be 
reversibly prevented from binding in a regulatable fashion, thereby turning "off the 

20 transcriptional silencing. For example, the KRAB domain from human Koxl (Thiesen. 
New Biol. 2:363-374, 1990) can be fused to the DNA-binding lactose (lac) repressor 
protein, forming a hybrid transcriptional silencer with reversible, sequence-specific 
binding to a lac operator sequence engineered immediately adjacent to the tet- 
responsive promoter. In this configuration, constitutive expression of the lac 

25 repressor/KRAB domain fusion (rKR) will result in binding to the lac operator sequence 
and the elimination of any "leaky" basal transcription from the uninduced tet-responsive 
promoter. When vector expression is desired and tetracycline is removed from the 
system, IPTG is added to prevent rKR-mediated transcriptional silencing. 

In addition, KRAB domains from other zinc finger proteins, for example, 

30 ZNF133 (Tommerup et al., Hum. Mol Genet. 2:1571-1575, 1993), ZNF91 (Bellefroid 
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et al., EMBOJ. 7^:1363-1374, 1993), ZNF2 (Rosati et al., Nucleic Acids Res. J 9:5661- 
5667, 1991). as well as other transferable repressor domains, for example, Drosophila 
en or eve genes fJaynes and O'FarreiL EMBOJ. 70:1427-1433, 1991; Han and Manley, 
Genes Dev. 7:491-503. 1993), human zinc finger protein YY1 (Shi et al., Cell 67:311- 
5 3S8, 1991), Wilms' tumor suppressor protein WT1 (Madden et al.. Science 253:1550- 
1553, 1991), thyroid hormone receptor (Baniahmad et al., EMBO J. 77:1015-1023, 
1992), retinoic acid receptor (Baniahmad et al.. ibid), Kid- 1 (Witzgall et al., Proc. Natl. 
Acad. Sci. USA 97:4514-4518, 1994), can likewise be readily used in the gene delivery 
vehicles provided herein. Furthermore, the lac repressor/lac operator component of this 
10 svstem mav be substituted by any number of other regulatable systems derived from 
other sources, for example, the tryptophan and maltose operons. or GAL4. 

E. Recombinant Alphavirus Particles, and Generation and Use of 'Empty 1 
Togavirus Particles or Togaviruses Particles containing non-homologous viral 
15 RNA 

Within another aspect of the present invention, the generation of 
recombinant alphavirus panicles containing RNA alphavirus vectors, which are capable 
of infection of eukaryotic target cells, are described. Briefly, such recombinant 
alphavirus particles generally comprise one or more alphavirus structural proteins, a 

20 lipid envelope, and an RNA vector repiicon as described herein. 

Methods for generating recombinant alphavirus vector particles may be 
readily accomplished by, for example, co-transfection of complementing vector and 
defective helper (DH) molecules derived from in vitro transcribed RNA, or, 
alternatively, plasmid DNA, or by coinfection with virus (see Xiong et al.. Science 

25 243:\ 188-1191, 1989, Bredenbeek et a!., J Virol. 67:6439-6446, 1993, Dubensky et al., 
J. Virol 70:508-519. 1996 and Dubensky et al.. W/O 95/07994). 

Within other aspects, methods for generating recombinant alphavirus 
vector particles from alphavirus-derived packaging or producer cell lines are provided. 
Briefly, such PCL and their stably transformed structural protein expression cassettes 

30 can be derived using methods described within W/O 95/07994, or using novel methods 
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described within this invention. For example, the production of recombinant alphavirus 
vector panicles by PCL can be accomplished following introduction of alphavirus- 
based vector molecules with desirable properties into the PCL (see Example 6), the 
vectors being derived from in vitro transcribed RNA, plasmid DNA, or previously 
5 obtained recombinant alphavirus panicles. In yet a further example, production of 
recombinant panicles from alphavirus vector producer cell lines is descnbed (see 
Example 7). 

Within other embodiments, methods are provided for producing high- 
titer stable togavirus capsid panicles that do not contain any genomic RNA (i.e., contain 

10 substantially no viral RNA) or RNA Vector Replicons. As utilized within the present 
invention, it should be understood that "substantially no" genomic or RNA Vector 
RepHcon nucleic acids refers to ratios of greater than 10:1, and preferably greater than 
15:1 of J5 S methionine versus J H uridine incorporation into virus panicles (as compared 
to wild-type) (see. e.g., Example S and Figure 38). For example, within one 

15 embodiment empty capsid panicles (preferably with the lipid bilayer and lycoprotein 
complement) are constructed from a selected pathogenic virus from the togavirus family 
(such as an Alphavirus or Rubivirus), and used as immunogens to establish protective 
immunity against infection with the wild-type togavirus. The empty viral panicles are a 
desirable immunogenic alternative, as they are unable to replicate and produce virus, yet 

20 are able to generate both cellular and humoral immune responses. Thus, utilizing the 
methods which are described in more detail in Example 8. empty capsid panicles 
derived from togaviruses (with or without a lipid bilayer and glycoprotein complement) 
can be generated from a wide variety of togaviruses. including, but not limited to, 
alphaviruses (such as Sindbis Virus (e.g., SIN-1 or wild-type Sindbis virus), 

25 Venezuelan Equine Encephalitis virus, Ross River virus. Eastern Equine Encephalitis 
virus. Western Equine Encephalitis virus, and rubiviruses (e.g., rubella), 

In a second embodiment, sequences from heterologous viruses which 
encode peptides that bind to genomic viral RNA can be insened into a defective helper 
(DH) expression cassette in the amino terminal region of the alphavirus capsid gene, 

30 which has been deleted of the sequences which encode the region of the protein that 
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binds to the homologous alphavirus genomic RNA. For example. BHK cells can be 
electroporated with an alphavirus replicon RNA, a DH RNA containing a sequence that 
encodes a heterologous virus genomic RNA binding peptide, and a replicon derived 
from the same heterologous virus. Thus, the alphavirus particles produced contain a 
5 genomic RNA from a heterologous virus, and possess the host-range tropism of the 
alphavirus. As one possible example, gag sequences encoding proteins required for 
retrovirus RNA binding are included in the DH expression cassette construct. In this 
configuration, the resulting alphavirus particles would contain retrovirus vector RNA. 

10 F. Heterologous Sequences 

As noted above, a wide variety of nucleotide sequences may be carried 
and expressed by the gene delivery vehicles of the present invention. Preferably, the 
nucleotide sequences should be of a size sufficient to allow production of viable virus. 
Within the context of the present invention, the production of any measurable titer by 

15 recombinant alphavirus particles, for example, by plaque assay, iuciferase assay, or 
P-galactosidase assay of infectious virus on appropriate susceptible monolayers, or the 
expression of detectable levels of the heterologous gene product by RNA or DNA 
vectors, is considered to be "production of viable virus." This may be, at a minimum, 
an alphavirus vector construct which does not contain any additional heterologous 

20 sequence. However, within other embodiments, the vector construct may contain 
additional heterologous or foreign sequences. Within preferred embodiments, the 
heterologous sequence can comprise a heterologous sequence of at least about 100 
bases, 2 kb, 3.5 kb ; 5 kb, 7 kb, or even a heterologous sequence of at least about 8 kb. 

As will be evident to one of ordinary skill in the art given the disclosure 

25 provided herein, the efficiency of recombinant alphavirus particle packaging and hence, 
viral titer, is to some degree dependent upon the size of the sequence to be packaged. 
Thus, in order to increase the efficiency of packaging and the production of viable virus, 
additional non-coding sequences may be added to the vector construct. Moreover, 
within certain embodiments of the invention it may be desired to increase or decrease 

30 viral titer. This increase or decrease may be accomplished by increasing or decreasing 
the size of the heterologous sequence, and hence the efficiency of packaging. 
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As briefly noted above, a wide variety of heterologous sequences may be 
included within the gene delivery vehicles described herein including, for example, 
sequences which encode palliatives such as lymphokines or cytokines, toxins, prodrug 
converting enzyme, antigens which stimulate an immune response, ribozymes, proteins 
5 for therapeutic application such as growth or regulatory factors, and proteins which 
assist or inhibit an immune response, as well as antisense sequences (or sense sequences 
for "antisense applications"). In addition, as discussed above, the gene delivery vehicles 
provided herein may contain (and express, within certain embodiments) two or more 
heterologous sequences. 

10 

1. Lvmphokines 

Within one embodiment of the invention, the heterologous sequence 
encodes a lymphokine. Briefly, lymphokines act to proliferate, activate, or differentiate 
immune effectors cells. Representative examples of lymphokines include gamma 

15 interferon, tumor necrosis factor, 1L-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, 
IL-10, IL-1 1, IL-12. IL-13, IL-14, IL-15, GM-CSF, CSF-1 and G-CSF. 

Within related embodiments of the invention, the heterologous sequence 
encodes an immunomodulatory cofactor. Briefly, as utilized within the context of the 
present invention, "immunomodulatory cofactor" refers to factors which, when 

20 manufactured by one or more of the cells involved in an immune response, or when 
added exoeenouslv to the cells, causes the immune response to be different in quality or 
potency from that which would have occurred in the absence of the cofactor. The 
quality or potency of a response may be measured by a variety of assays known to one 
of skill in the an including, for example, in vitro assays which measure cellular 

25 proliferation {e.g., 3 H thymidine uptake), and in vitro cytotoxic assays {e.g., which 
measure i[ Cr release) {see Wamer et al., AIDS Res. and Human Retroviruses 7:645-655, 
1991). 

Representative examples of immunomodulatory co-factors include alpha 
interferon (Finter et al., Drugs 4?(5):749-765, 1991; U.S. Patent No. 4,892,743; U.S. 
30 Patent No. 4,966.843; WO 85/02862; Nagata et al., Nature 2^:316-320, 1980; 
Familletti et al., Methods in Enz. 75:387-394, 1981; Twu et al., Proc. Natl. Acad. Sci. 
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USA 56:2046-2050. 1989; Faktor etal., Oncogene 5:867-872, 1990), beta interferon 
(Scif etal., J. Virol. 65:664-671. 1991), gamma interferons (Radford etal., American 
Society of Hepatoiogy:2QQ&-2Q\5, 1991; Watanabe etal., PNAS 56:9456-9460, 1989; 
Gansbacher etal.. Cancer Research 50: 7820-7825, 1990; Maio etal., Can. Immunol. 
5 Immunother. 50:34-42, 1989; U.S. Patent Nos. 4,762,791 and 4,727.138), G-CSF (U.S. 
Patent Nos. 4,999.291 and 4,810,643), GM-CSF (WO 85/04188), TNFs (Jayaraman 
etal., J. Immunology 144:942-951. 1990), mterleukin- 2 (IL-2) (Karupiah etal., J. 
Immunology W:290-298. 1990; Weber etal., J. Exp. Med. 766:1716-1733, 1987; 
Gansbacher etal.. y. Exp. Med. 772:1217-1224, 1990; U.S. Patent No. 4,738.927), IL-4 

10 (Tepper etal., Cell 57:503-512, 1989; Golumbek etal, Science 254:713-716, 1991: 
U.S. Patent No. 5.017,691). IL-6 (Brakenhof et al., J. Immunol. 759:41 16-4121, 1987; 
WO 90/06370), IL-12. IL-15 (Grabstein etal., Science 264:965-968, 1994; Genbank- 
EMBL Accession No. V03099), ICAM-1 fAltman etal.. Nature 555:512-514, 1989), 
ICAM-2, LFA-1. LFA-3. MHC class 1 molecules, MHC class II molecules, 

15 ^microglobulin, chaperones. CD3, B7/BB1, MHC linked transporter proteins or 
analogues thereof. 

The choice of which immunomodulatory cofactor to include within a 
alphavirus vector construct may be based upon known therapeutic effects of the 
cofactor. or experimentally determined. For example, in chronic hepatitis B infections 

20 alpha interferon has been found to be efficacious in compensating a patient's 
immunological deficit and thereby assisting recover)' from the disease. Alternatively, a 
suitable immunomodulatory cofactor may be experimentally determined. Briefly, blood 
samples are first taken from patients with a hepatic disease. Peripheral blood 
lymphocytes (PBLs) are restimulated in vitro with autologous or HLA-matched cells 

25 (e.g., EBV transformed cells), and transduced with an alphavirus vector construct which 
directs the expression of an immunogenic portion of a hepatitis antigen and the 
immunomodulatory cofactor. Stimulated PBLs are used as effectors in a CTL assay 
with the HLA-matched transduced cells as targets. An increase in CTL response over 
that seen in the same assay performed using HLA-matched stimulator and target cells 

30 transduced with a vector encoding the antigen alone, indicates a useful 
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immunomodulatory cofactor. Within one embodiment of the invention, the 
irrimunomodulatory cofactor gamma interferon is particularly preferred. 

Another example of an immunomodulatory cofactor is the B7/BB1 
costimuiatory factor. Briefly, activation of the full functional activity of T cells requires 
5 two signals. One signal is provided by interaction of the antigen-specific T cell receptor 
with peptides which are bound to major histocompatibility complex (MHC) molecules, 
and the second signal, referred to as costimulation, is delivered to the T cell by antigen- 
presenting cells. The second signal is required for interleukin-2 (IL-2) production by 
T cells and appears to involve interaction of the B7/BB1 molecule on antigen- 
ic" presenting cells with CD28 and CTLA-4 receptors on T lymphocytes (Linsley et al., J. 
Exp. Med. 775:721-730, 1991a. and J. Exp. Med. 174:561-510, 1991). Within one 
embodiment of the invention, B7/BB1 may be introduced into tumor cells in order to 
cause costimulation of CDS' T cells, such that the CD8 T T cells produce enough IL-2 to 
expand and become fully activated. These CD8" T cells can kill tumor cells that are not 
15 expressing B7 because costimulation is no longer required for further CTL function. 
Vectors that express both the costimuiatory B7/BB1 factor and, for example, an 
immunogenic HBV core protein, may be made utilizing methods which are described 
herein. Cells transduced with these vectors will become more effective antigen- 
presenting cells. The HBV core-specific CTL response will be augmented from the 
20 fully activated CDS" T cell via the costimuiatory ligand B7/BB 1 . 

2. Toxins 

Within another embodiment of the invention, the heterologous sequence 
encodes a toxin. Briefly, toxins act to directly inhibit the growth of a cell. 

25 Representative examples of toxins include ricin (Lamb et al., Eur, J. Biochem. 148:265- 
270, 1985), abrin (Wood et al., Eur. J. Biochem. 198:123-132, 1991; Evensen el al., 
J. of Biol. Chem. 2(56:6848-6852, 1991; Collins etal., J. of Biol. Chem. 265:8665-8669, 
1990; Chen et al., Fed. of Eur. Biochem Soc. 509:115-118, 1992), diphtheria toxin 
(Tweten et al., J. Biol. Chem. 260:10392-10394, 1985), cholera toxin (Mekalanos et al., 

30 Nature 506:551-557, 1983; Sanchez and Holmgren, PNAS 56:481-485, 1989), gelonin 
(Stirpe etaL J. Biol. Chem. 255:6947-6953, 1980), pokeweed (Irvin, Pharmac. Ther. 
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27:371-387. 1983), antiviral protein (Barbien et al, Biochem. J. 203:55-59. 1982; Irvin 
et at., Arch. Biochem. & Biophys. 200:418-425. 1980; Irvin, Arch. Biochem. & Biophys. 
7(59:522-528, 1975). tritin. Shigella toxin (Calderwood et al.. PNAS £4:4364-4368, 
1987; Jackson et al.. Microb. Path. 2:147-153, 1987), Pseudomonas exotoxin A (Carroll 
5 and Collier, J. Biol. Chem. 262:8707-8711, 1987), herpes simplex virus thymidine 
kinase (HSVTK) (Field et aL J. Gen. Virol. 49:115-124, 1980), and E. coli. guanine 
phosphoribosyl transferase. 

3. Prodrug convening enzymes 

10 Within other embodiments of the invention, the heterologous sequence 

encodes a prodrug converting enzyme. Briefly, as utilized within the context of the 
present invention, a prodrug convening enzyme refers to a gene product that activates a 
compound with little or no cytotoxicity into a toxic product (the prodrug). 
Representative examples of such gene products include HSVTK and VZVTK (as well 

15 as analogues and derivatives thereof), which selectively monophosphorylate cenain 
purine arabinosides and substituted pyrimidine compounds, convening them to 
cytotoxic or cytostatic metabolites. More specifically, exposure of the drugs 
ganciclovir, acyclovir, or any of their analogues (e.g., FIAU. FIAC, DHPG) to HSVTK 
phosphorylates the drug into its corresponding active nucleotide triphosphate form. 

20 Representative examples of other prodrug convening enzymes which can 

also be utilized within the context of the present invention include: E. coli guanine 
phosphoribosyl transferase which convens thioxanthine into toxic thioxanthine 
monophosphate (Besnard et ah, Mol. Cell Biol. 7:4139-4141, 1987); alkaline 
phosphatase, which convens inactive phosphorylated compounds such as mitomycin 

25 phosphate and doxorubicin-phosphate into toxic dephosphorylated compounds; fungal 
(e.g., Fusarium oxysporam) or bacterial cytosine deaminase, which convens 5- 
fluorocytosine to the toxic compound 5-fluorouracil (Mullen, PNAS 59:33, 1992); 
carboxypeptidase G2, which cleaves the glutamic acid from para-N-bis (2-chloroethyl) 
aminobenzoyl glutamic acid, thereby creating a toxic benzoic acid mustard; and 

30 Penicillin-V amidase. which converts phenoxyacetabide derivatives of doxorubicin and 
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melphalan 10 toxic compounds (see generally. Vrudhula et aL J. of Med. Chem. 
5tf(7):919-923, 1993; Kem et al.. Cane. Immun. Immunother. i/(4):202-206. 1990). 

4. Antisense Sequences 

5 Within another embodiment of the invention, the heterologous sequence 

is an antisense sequence. Briefly, antisense sequences are designed to bind to RNA 
transcripts, and thereby prevent cellular synthesis of a particular protein or prevent use 
of that RNA sequence by the cell. Representative examples of such sequences include 
antisense thymidine kinase, antisense dihydrofolate reductase (Maher and Dolnick, 

10 Arch. Biochem. & Biophys. -'53:214-220, 1987; Bzik et aL, PNAS 54:8360-8364, 1987), 
antisense HER2 (Coussens ei aL Science 250:1132-1 139, 1985), antisense ABL 
(Fainstein et al.. Oncogene 4:1477-1481, 1989), antisense Myc (Stanton et al., Nature 
3/0:423-425, 1984) and antisense ras, as well as antisense sequences which block any 
of the cell cycle signaling components (e.g., cyclins, cyclin-dependent kinases, cyclin- 

!5 dependent kinase inhibitors) or enzymes in the nucleotide biosynthetic pathway. In 
addition, within other embodiments of the invention antisense sequences to interferon 
and 2 microglobulin may be utilized in order to decrease immune response. 

In addition, within a further embodiment of the invention, antisense 
RNA mav be utilized as an anti-tumor agent in order to induce a potent Class 1 

20 restricted response. Briefly., in addition to binding RNA and thereby preventing 
translation of a specific mRNA, high levels of specific antisense sequences are believed 
to induce the increased expression of interferons (including gamma-interferon) due to 
the formation of large quantities of double-stranded RNA. The increased expression of 
gamma interferon, in turn, boosts the expression of MHC Class 1 antigens. Preferred 

25 antisense sequences for use in this regard include actin RNA, myosin RNA, and histone 
RNA. Antisense RNA which forms a mismatch with actin RNA is particularly 
preferred. 

5. Ribozymes 

30 Within other aspects of the present invention, gene delivery vehicles are 

provided which produce ribozymes upon infection of a host cell. Briefly, ribozymes are 
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RNA sequence. Generally, the substrate binding sequence of a ribozyme is between 10 
and 20 nucleotides long. The length of this sequence is sufficient to allow a 
hybridization with target RNA and disassociation of the ribozyme from the cleaved 
5 RNA. 

A wide variety of ribozymes may be utilized within the context of the 
present invention, including for example. Group I intron ribozymes (Cech et al., U.S. 
Patent No. 4,987.071); hairpin ribozymes (Hampel et al., NucL Acids Res. 7(5:299-304, 
1990, U.S. Patent No. 5.254.678 and European Patent Publication No. 0 360 257), 

10 hammerhead ribozvmes (Rossi. J.J. et al.. Phannac. Ther. i0:245-254, 1991; Forster 
and Symons. Cell 48:21 1-220. 1987; Haseloff and Gerlach, Nature 328:596-600. 1988; 
Walbot and Bruening, Nature 334A96, 1988; Haseloff and Gerlach. Nature 534:585, 
1988). hepatitis delta virus ribozymes (Perrotta and Been. Biochem. 31:\6, 1992); 
RNase P ribozymes (Takada et al., Cell 35:849, 1983); as well as other types of 

15 ribozymes (see e.g., WO 95/29241. and WO 95/31551). Further examples of ribozymes 
include those described in U.S. Patent Nos. 5.1 16,742, 5,225,337 and 5,246,921. 

6. Proteins and other cellular constituents 

Within other aspects of the present invention, a wide variety of proteins 
20 or other cellular constituents may be carried and/or expressed by the gene delivery 
vehicles provided herein. Representative examples of such proteins include native or 
altered cellular components, as well as foreign proteins or cellular constituents, found in 
for example, viruses, bacteria, parasites or fungus. 

25 a. Altered Cellular Components 

Within one embodiment, gene delivery vehicles are provided which 
direct the expression of an immunogenic, non-rumorigemc, altered cellular component 
{see, e.g., WO 93/10814). As utilized herein, the term "immunogenic" refers to altered 
cellular components which are capable, under the appropriate conditions, of causing an 

30 immune response. This response must be cell-mediated, and may also include a 
humoral response. The term "non-tumorigenic" refers to altered cellular components 
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which will not cause cellular transformation or induce tumor formation in nude mice. 
The phrase "altered cellular component" refers to proteins and other cellular 
constituents which are either associated with rendering a cell tumorigenic. or are 
associated with tumorigenic cells in general, but are not required or essential for 
5 rendering the cell tumorigenic. 

Briefly, before alteration, the cellular components may be essential to 
normal ceil growth and regulation and include, for example, proteins which regulate 
intracellular protein degradation, transcriptional regulation, cell-cycle control, and cell- 
cell interaction. After alteration, the cellular components no longer perform their 
0 regulatory functions and. hence, the cell may experience uncontrolled growth. 
Representative examples of altered cellular components include ras\ p53'. Rb\ altered 
protein encoded by the Wilms 1 tumor gene, ubiquitin*, mucin*, protein encoded by the 
DCC, APC, and MCC genes, the breast cancer gene BRCAl", as well as receptors or 
receptor-like structures such as neu, thyroid hormone receptor, platelet derived growth 
5 factor (PDGF) receptor, insulin receptor, epidermal growth factor (EGF) receptor, and 
the colony stimulating factor (CSF) receptor. 

Once a sequence encoding the altered cellular component has been 
obtained, it is necessary to ensure that the sequence encodes a non-tumorigenic protein. 
Various assays which assess the tumorigenicity of a particular cellular component are 
known and may easily be accomplished. Representative assays include a rat fibroblast 
assay, tumor formation in nude mice or rats, colony formation in soft agar, and 
preparation of transgenic animals, such as transgenic mice. 

Tumor formation in nude mice or rats is a particularly important and 
sensitive method for determining the tumorigenicity of a particular cellular component. 
Nude mice lack a functional cellular immune system (i.e., do not possess CTLs), and 
therefore provide a useful in vivo model in which to test the tumorigenic potential of 
cells. Normal non-tumorigenic cells do not display uncontrolled growth properties if 
infected into nude mice. However, transformed cells will rapidly proliferate and 
generate tumors in nude mice. Briefly, in one embodiment an alphavirus vector 
construct is administered to syngeneic murine cells, followed by injection into nude 
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mice. The mice are visually examined for a period of 2 to 8 weeks after injection in 
order to determine tumor growth. The mice may also be sacrificed and autopsied in 
order to determine whether tumors are present. (Giovanella et al.. J. Nail. Cancer Insi. 
48: 153 1 -1533, 1972; Furesz et al.. Abnormal Cells, New Products and Risk, Hopps and 
5 Petricciani (eds.), Tissue Culture Association. 1985; and Levenbook et ah, J. Biol. Std. 
/5:I35-14i. 1985.) 

Tumorigenicity may also be assessed by visualizing colony formation in 
soft agar (Macpherson and Montagnier. Virol. 23:29 1-294, 1964). Briefly, one property 
of normal non-tumorisenic cells is "contact inhibition" (i.e.. cells will stop proliferating 
10 when thev touch neighboring cells). If cells are plated in a semi-solid agar support 
medium, normal ceils rapidly become contact inhibited and stop proliferating, whereas 
tumorigenic cells will continue to proliferate and form colonies in soft agar. 

If the altered cellular component is associated with making the cell 
tumorigenic, then it is necessary to make the altered cellular component non- 
15 tumorigenic. For example, within one embodiment the sequence or gene of interest 
which encodes the altered cellular component is truncated in order to render the gene 
product non-tumorigenic. The gene encoding the altered cellular component may be 
truncated to a variety of sizes, although it is preferable to retain as much as possible of 
the altered cellular component. In addition, it is necessary that any truncation leave 
20 intact at least some of the immunogenic sequences of the altered cellular component. 
Alternatively, multiple translational termination codons may be introduced downstream 
of the immunogenic region. Insertion of termination codons will prematurely terminate 
protein expression, thus preventing expression of the transforming portion of the 
protein. 

25 As noted above, in order to generate an appropriate immune response, 

the altered cellular component must also be immunogenic. Immunogenicity of a 
particular sequence is often difficult to predict, although T cell epitopes often possess an 
immunogenic amphipathic alpha-helix component. In general, however, it is preferable 
to determine immunogenicity in an assay. Representative assays include an ELISA, 

30 which detects the presence of antibodies against the newly introduced vector, as well as 
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assays which test tor T helper cells such as gamma-interferon assays, IL-2 production 
assays, and proliferation assays. 

As noted above, within another aspect of the present invention, several 
different altered cellular components may be co-expressed in order to form a general 

5 anti-cancer therapeutic. Generally, it will be evident to one of ordinary skill in the an 
that a variety of combinations can be made. Within preferred embodiments, this 
therapeutic may be targeted to a particular type of cancer. For example, nearly all colon 
cancers possess mutations in ras, p53. DCC APC or MCC genes. An alphavirus vector 
construct which co-expresses a number of these altered cellular components may be 

10 administered to a patient with colon cancer in order to treat all possible mutations. This 
methodology may also be utilized to treat other cancers. Thus, an alphavirus vector 
construct which co-expresses mucin', ras". neu. BRCA1* and p53* may be utilized to 
treat breast cancer. 

15 b. Antigens from foreign organisms or other pathogens 

Within other aspects of the present invention, vectors are provided which 
direct the expression of immunogenic portions of antigens from foreign organisms or 
other pathogens. Representative examples of such antigens include bacterial antigens 
(e.g., E. coli, streptococcal, staphylococcal, mycobacterial, etc.), fungal antigens. 

20 parasitic antigens, and viral antigens {e.g., influenza virus. Feline Leukemia Virus 
("FeLV"), immunodeficiency viruses such as Feline Immunodeficiency Virus ("FIV") 
or Human Immmunodeficiency Virus ("HIV"), Hepatitis A, B and C Virus ("HAV'\ 
"HBV" and "HCV", respectively), Respiratory Syncytial Virus, Human Papiioma Virus 
("HPV"), Epstein-Barr Virus ("EBV"), Herpes Simplex Virus ("HSV"), Hantavirus, 

25 HTLV I, HTLV II and Cytomegalovirus ("CMV"). As utilized within the context of the 
present invention, "immunogenic portion" refers to a portion of the respective antigen 
which is capable, under the appropriate conditions, of causing an immune response {i.e., 
cell-mediated or humoral). "Portions" may be of variable size, but are preferably at 
least 9 amino acids long, and may include the entire antigen. Cell-mediated immune 

30 responses may be mediated through Major Histocompatability Complex ("MHC") class 
I presentation, MHC Class II presentation, or both. 
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Within one aspect of the invention, alphavirus vector constructs are 
provided which direct the expression of immunogenic portions of Hepatitis B antigens 
(see, e.g., WO 93/15207). The Hepatitis B virus presents several different antigens, 
including among others, three HB "S" antigens (HBsAgs), an HBc antigen (HBcAg), an 
5 HBe antigen (HBeAg), and an HBx antigen (HBxAg) (see Blum et aL TIG 5(5): 154- 
158, 1989). Briefly, the HBeAg results from proteolytic cleavage of a P22 pre-core 
intermediate and is secreted from the cell. HBeAg is found in serum as a 1 7 kD protein. 
The HBcAg is a protein of 183 amino acids, and the HBxAg is a protein of 145 to 154 
amino acids, depending on subtype. 

10 The HBsAgs (designated "large," "middle" and "small") are encoded by 

three regions of the Hepatitis B genome: S, pre-S2 and pre-Sl. The large protein, 
which has a length varying from 389 to 400 amino acids, is encoded by pre-Sl, pre-S2. 
and S regions, and is found in glycosylated and non-glycosylated forms. The middle 
protein is 281 amino acids long and is encoded by the pre-S2 and S regions. The small 

15 protein is 226 amino acids long and is encoded by the S region. It exists in two forms, 
glycosylated (GP 27 s ) and non-glycosylated (P24 s ). If each of these regions are 
expressed separately, the pre-Sl region will code for a protein of approximately 119 
amino acids, the pre-S2 region will code for a protein of approximately 55 amino acids, 
and the S region will code for a protein of approximately 226 amino acids. 

20 As will be evident to one of ordinary skill in the art, various 

immunogenic portions of the above-described S antigens may be combined in order to 
induce an immune response when administered by one of the alphavirus vector 
constructs described herein. In addition, due to the large immunological variability that 
is found in different geographic regions for the S open reading frame of HBV, particular 

25 combinations of antigens may be preferred for administration in particular geographic 
regions. 

Also presented by HBV are pol ("HBV pot), ORF 5, and ORF 6 
antigens. Briefly, the polymerase open reading frame of HBV encodes reverse 
transcriptase activity found in virions and core-like particles in infected livers. The 
30 polymerase protein consists of at least two domains: the amino terminal domain which 
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encodes the protein that primes reverse transcription, and the carboxy] terminal domain 
which encodes reverse transcriptase and RNase H activity. Immunogenic portions of 
HBV pol may be determined utilizing methods described herein, utilizing alphavirus 
vector constructs described below, and administered in order to generate an immune 
5 response within a warm-blooded animal. Similarly, other HBV antigens, such as ORF 
5 and ORF 6 (Miller et al., Hepatology 9:322-327, 1989) may be expressed utilizing 
alphavirus vector constructs as described herein. 

As noted above, at least one immunogenic portion of an antigen from a 
foreign organism is incorporated into a gene delivery vehicle. The immunogenic 

10 portion(s) which are incorporated into the gene delivery vehicles may be of varying 
length, although it is generally preferred that the portions be at least 9 amino acids long 
and may include the entire antigen. Immunogenicity of a particular sequence is often 
difficult to predict, although T cell epitopes may be predicted utilizing computer 
algorithms such as TSITES (Medlmmune, Maryland), in order to scan coding regions 

15 for potential T-helper sites and CTL sites. From this analysis, peptides are synthesized 
and used as targets in an in vitro cytotoxic assay. Other assays, however, may also be 
utilized, including, for example, ELISA. which detects the presence of antibodies 
aeainst the newly introduced vector, as well as assays which test for T helper cells, such 
as gamma-interferon assays. IL-2 production assays and proliferation assays. 

20 Immunogenic portions may also be selected by other methods. For 

example, the HLA A2.1 transgenic mouse has been shown to be useful as a model for 
human T-cell recognition of viral antigens. Briefly, in the influenza and hepatitis B 
viral systems, the murine T cell receptor repertoire recognizes the same antigenic 
determinants recognized by human T cells. In both systems, the CTL response 

25 generated in the HLA A2.1 transgenic mouse is directed toward virtually the same 
epitope as those recognized by human CTLs of the HLA A2.1 haplotype (Vitiello et al., 
J.Exp. Med. /7J:1007-1015, 1991; Vitiello et al., Abstract of Molecular Biology of 
Hepatitis B Virus Symposia, 1992). 

As noted above, more than one immunogenic portion may be 

30 incorporated into the gene delivery vehicles. For example, a gene delivery vehicle may 
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express (either separately or as one construct) all or immunogenic portions of HBcAg, 
HBeAg, HBsAgs, HBxAg, as well as immunogenic portions of the HCV antigens C, 
EI, E2. NS3.NS4, orNS5. 

5 1. Sources for Heterologous Sequences 

Sequences which encode the above-descnbed proteins may be readily 
obtained from a variety of sources, including for example, depositories such as the 
.American Type Culture Collection (ATCC, Rockville, MD), or from commercial 
sources such as British Bio-Technology Limited (Cowley, Oxford, England). 

10 Representative examples include BBG 12 (containing the GM-CSF gene coding for the 
mature protein of 127 amino acids); BBG 6 (which contains sequences encoding 
gamma interferon), ATCC No. 39656 (which contains sequences encoding TNF), 
ATCC No. 20663 (which contain sequences encoding alpha interferon), ATCC Nos. 
31902, 31902 and 39517 (which contains sequences encoding beta interferon), ATCC 

15 No 67024 (which contain a sequence which encodes Interieukm-lb); ATCC Nos. 
39405, 39452. 39516, 39626 and 39673 (which contains sequences encoding 
Interieukin-2); ATCC Nos. 59399. 59398, and 67326 (which contain sequences 
encoding interleukin-3); ATCC No. 57592 (which contains sequences encoding 
Interleukin-4), ATCC Nos. 59394 and 59395 (which contain sequences encoding 

20 Interleukin-5), and ATCC No. 67153 (which contains sequences encoding Interleukin- 
6). 

Sequences which encode altered cellular components as described above 
may be readily obtained from a variety of sources. For example, plasmids which 
contain sequences that encode altered cellular products may be obtained from a 

25 depository such as the American Type Culture Collection (ATCC. Rockville, MD), or 
from commercial sources such as Advanced Biotechnologies (Columbia, Maryland). 
Representative examples of plasmids containing some of the above-described sequences 
include ATCC No. 41000 (containing a G to T mutation in the 12th codon of ras), and 
ATCC No. 41049 (containing a G to A mutation in the 12th codon). 

30 Alternatively, plasmids which encode normal cellular components may 

also be obtained from depositories such as the ATCC (see, for example, ATCC No. 
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41001, which contains a sequence which encodes the normal ras protein; ATCC No. 
57103, which encodes abi; and ATCC Nos. 59120 or 59121, which encode the bcr 
locus) and mutated to form the altered cellular component. Methods for mutagenizing 
particular sites may readily be accomplished using methods known in the an (see 
5 Sambrook et al.. supra., 15.3 et seq.). In particular, point mutations of normal cellular 
components such as ras may readily be accomplished by site-directed mutagenesis of 
the particular codon. for example, codons 12, 13 or 6 1 . 

Sequences which encode the above-described viral antigens may 
likewise be obtained from a variety of sources. For example, molecularly viral cloned 

10 genes may be obtained from sources such as the American Type Culture Collection 
(ATCC, Rockville. MD). For example, ATCC No. 45020 contains the total genomic 
DNA of hepatitis B (extracted from purified Dane panicles) {see Figure 3 of Blum 
et al.. TIC J(5):154-158, 1989) in the Bam HI site of pBR322 (Moriarty et al., Proc. 
Nail. Acad. Sci. USA 75:2606-2610, 1981). 

15 Alternatively, cDNA sequences which encode the above-described 

heterologous sequences may be obtained from cells which express or contain the 
sequences. Briefly, within one embodiment, mRNA from a cell which expresses the 
gene of interest is reverse transcribed with reverse transcriptase using oligonucleotide 
dT or random pnmers. The single stranded cDNA may then be amplified by PCR (see 

20 U.S. Patent Nos. 4.683.202; 4.683.195 and 4,800,159. See also PCR Technology: 
Principles and Applications for DNA Amplification, Erlich (ed.), Stockton Press, 1989) 
utilizing oligonucleotide primers complementary to sequences on either side of desired 
sequences. In particular, a double-stranded DNA is denatured by heating in the 
presence of heat stable Taq polymerase, sequence-specific DNA primers, dATP, dCTP, 

25 dGTP and dTTP. Double-stranded DNA is produced when synthesis is complete. This 
cycle may be repeated many times, resulting in a factorial amplification of the desired 
DNA. 

Sequences which encode the above-described proteins may also be 
synthesized, for example, on an Applied Biosystems Inc. DNA synthesizer {e.g., APB 
30 DNA synthesizer model 392 (Foster City, CA)). 
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G. Alphavirus Packaging / Producer Cell Lines 

Within further aspects of the invention, alphavirus packaging and 
producer cell lines are provided. In particular, within one aspect of the present 
5 invention, alphavirus packaging ceil lines are provided wherein the virai structural 
proteins are supplied in trans from one or more stably transformed expression vectors, 
and are able to encapsidate transfected, transduced, or intracellularly produced vector 
RNA transcripts in the cytoplasm and release infectious packaged vector panicles 
through the cell membrane. In preferred embodiments, the structural proteins necessary 

10 for packaging are synthesized at high levels only after induction by the RNA vector 
replicon itself or some other provided stimulus, and the transcripts encoding these 
structural proteins are capable of cytoplasmic amplification in a manner that will allow 
expression levels sufficient to mimic that of a natural viral infection. Furthermore, in 
other embodiments, expression of a selectable marker is operably linked to the 

15 structural protein expression cassette. Such a linked selectable marker allows efficient 
generation of functional, stably transformed PCL. 

For example, alphavirus RNA vector replicon molecules of the desired 
phenotype to be packaged, which are themselves capable of autocataiytic replication in 
the cell cytoplasm, can be introduced into the packaging cells as in vitro transcribed 

20 RNA. recombinant alphavirus panicles, or as alphavirus cDNA vector constructs. The 
RNA vector transcripts then replicate to high levels, stimulate amplification of the 
structural protein gene transcript(s) and subsequent protein expression, and are 
subsequently packaged by the viral structural proteins, yielding infectious vector 
panicles. The intracellular expression of alphavirus proteins and/or vector RNA above 

25 certain levels may result in cytotoxic effects in packaging or producer cell lines. 
Therefore, within cenain embodiments of the invention, it may be desirable for these 
elements to be derived from virus variants selected for reduced cytotoxicity of their 
expressed structural proteins, reduced inhibition of host macromolecuiar synthesis, 
and/or the ability to establish persistent infection. 

30 To optimize vector packaging cell line performance and final vector titer, 

successive cycles of gene transfer and vector packaging may be performed. For 
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example, supernatams containing infectious packaged vector panicles derived from 
vector transfection of the packaging cell lines, can be used to infect or "transduce" a 
fresh monolayer or suspension culture of alphavirus packaging cells. Successive 
transductions with packaged vector particles and fresh packaging cells may be preferred 
5 over nucleic acid transfection because of its higher RNA transfer efficiency into cells, 
optimized biological placement of the vector in the cell, and ability to "scale-up" the 
process for vector production from increasingly larger numbers of packaging cells. This 
leads to higher expression and higher titer of packaged infectious recombinant 
alphavirus vector. 

10 Within other aspects of the invention, a stably integrated or episomally 

maintained DNA expression vector can be used to produce the alphavirus vector RNA 
molecule within the cell. Briefly, such a DNA expression vector can be configured, in 
preferred embodiments, to be inducible, such that trancription of the alphavirus vector 
RNA occurs only when cells have been propagated to a desired density, and are 

15 subsequently induced. Once transcribed, the alphavirus vector maintains the ability to 
self-replicate autocatalytically and triggers a cascade of events that culminate in 
packaged vector particle production. This approach allows for continued vector 
expression over extended periods of culturing because the integrated DNA vector 
expression system is maintained through a drug or other selection marker and the DNA 

20 system, once induced, will constitutively express unaltered RNA vector replicons which 
cannot be diluted out by defective RNA copies. Production of larger-scale, high titer 
packaged alphavirus vector is possible in this alphavirus "producer cell line" 
configuration, the DNA-based alphavirus vector is introduced initially into the 
packaging cell line by transfection, since size restrictions could prevent packaging of 

25 the expression vector into a viral vector particle for transduction. 

H. Pharmaceut ical Compositions 

As noted above, the present invention also provides pharmaceutical 
compositions comprising the gene delivery vehicles described herein in combination 
30 with a pharmaceutically acceptable carrier, diluent, or recipient. For example, within 
one embodiment, RNA or DNA vector constructs of the present invention can be 
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lyophilized for long term storage and transport, and may be reconstituted prior to 
administration using a variety of substances, but are preferably reconstituted using 
water. In certain instances, dilute salt solutions which bring the final formulation to 
isotonicity may also be used. In addition, it may be advantageous to use aqueous 
5 solutions containing components which enhance the activity or physically protect the 
reconstituted nucleic acid preparation. Such components include cytokines, such as IL- 
2, polycations, such as protamine sulfate, lipid formulations, or other components. 
Lyophilized or dehydrated recombinant vectors may be reconstituted with any 
convenient volume of water or the reconstituting agents noted above that allow 

1 0 substantial, and preferably total solubilization of the lyophilized or dehydrated sample. 

Recombinant alphavirus panicles or infectious recombinant virus (both 
referred to as virus below) may be preserved either in crude or purified forms. In order 
to produce virus in a crude form, producing cells may first be cultivated in a bioreactor 
or flat stock culture, wherein viral panicles are released from the cells into the culture 

15 media. Virus may then be preserved in crude form by first adding a sufficient amount 
of a formulation buffer to the culture media containing the recombinant virus to form an 
aqueous suspension. Within cenain preferred embodiments, the formulation buffer is 
an aqueous solution that contains a saccharide, a high molecular weight structural 
additive, and a buffering component in water. The aqueous solution may also contain 

20 one or more amino acids. 

The recombinant virus can also be preserved in a purified form. More 
specifically, prior to the addition of the formulation buffer, the crude recombinant virus 
described above may be clarified by passing it through a filter and then concentrated, 
such as by a cross flow concentrating system (Filtron Technology Corp., Nortborough, 

25 MA). Within one embodiment, DNase is added to the concentrate to digest exogenous 
DNA. The digest is then diafiltrated in order to remove excess media components and 
to establish the recombinant virus in a more desirable buffered solution. The diafiltrate 
is then passed over a Sephadex S-500 gel column and a purified recombinant virus is 
eluted. A sufficient amount of formulation buffer is then added to this eluate in order to 

30 reach a desired final concentration of the constituents and to minimally dilute the 
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recombinant virus. The aqueous suspension may then be stored, preferably at -70°C. or 
immediately dried. As above, the formulation buffer may be an aqueous solution that 
contains a saccharide, a high molecular weight structural additive, and a buffering 
component in water. The aqueous solution may also contain one or more amino acids. 
5 Crude recombinant virus may also be purified by ion exchange column 

chromatography. Briefly, crude recombinant virus may be clarified by first passing it 
through a filter, followed by loading the filtrate onto a column containing a highly 
sulfonated cellulose matrix. The recombinant virus may then be eluted from the column 
in purified form by using a high salt buffer, and the high salt buffer exchanged for a 

10 more desirable buffer by passing the eluate over a molecular exclusion column. A. 
sufficient amount of formulation buffer is then added, as discussed above, to the 
purified recombinant virus and the aqueous suspension is either dried immediately or 
stored, preferably at -70°C. 

The aqueous suspension in crude or purified form can be dried by 

15 lyophilization or evaporation at ambient temperature. Briefly, lyophilization involves 
the steps of cooling the aqueous suspension below the gas transition temperature or 
below the eutectic point temperature of the aqueous suspension, and removing water 
from the cooled suspension by sublimation to form a lyophilized virus. Within one 
embodiment, aliquots of the formulated recombinant virus are placed into an Edwards 

20 Refrigerated Chamber (3 shelf RC3S unit) attached to a freeze dryer (Supermodulyo 
12K). A multistep freeze drying procedure as described by Phillips et al. (Cryobiology 
75:414, 1981) is used to lyophilize the formulated recombinant virus, preferably from a 
temperature of -40°C to -45°C. The resulting composition contains less than 10% water 
by weight of the lyophilized virus. Once lyophilized, the recombinant virus is stable 

25 and may be stored at -20°C to 25°C, as discussed in more detail below. 

Within the evaporative method, water is removed from the aqueous 
suspension at ambient temperature by evaporation. Within one embodiment, water is 
removed through spray-drying (EP 520,748). Within the spray-drying process, the 
aqueous suspension is delivered into a flow of preheated gas, usually air. whereupon 

30 water rapidly evaporates from droplets of the suspension. Spray-drying apparatus are 
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available from a number of manufacturers {e.g., Drytec, Ltd.. Tonbridge, England; Lab- 
Plant, Ltd.. Huddersfield, England). Once dehydrated, the recombinant virus is stable 
and may be stored at -20°C to 25°C. Within the methods described herein, the resulting 
moisture content of the dried or lyophilized virus may be determined through use of a 
5 Karl-Fischer apparatus (EM Science Aquastar' VI B volumetric titrator, Cherry Hill, 
NJ), or through a gravimetric method. 

The aqueous solutions used for formulation, as previously described, are 
preferably composed of a saccharide, high molecular weight structural additive, a 
buffering component, and water. The solution may also include one or more amino 

10 acids. The combination of these components act to preserve the activity of the 
recombinant virus upon freezing and lyophilization or drying through evaporation. 
Although one saccharide that can be utilized is lactose, other saccharides may likewise 
be utilized including, for example, sucrose, mannitol, glucose, trehalose, inositol, 
fructose, maltose or galactose. In addition, combinations of saccharides can be used, for 

15 example, lactose and mannitol, or sucrose and mannitol. A particularly preferred 
concentration of lactose is 3%-4% by weight. Preferably, the concentration of the 
saccharide ranges from 1% to 12% by weight. 

The high molecular weight structural additive aids in preventing viral 
aggregation during freezing and provides structural support in the lyophilized or dried 

20 state. Within the context of the present invention, structural additives are considered to 
be of "high molecular weight" if they are greater than 5000 m.w. A preferred high 
molecular weight structural additive is human serum albumin. However, other 
substances may also be used, such as hydroxyethyl-cellulose, hydroxymethyl-cellulose, 
dextran, cellulose, gelatin, or povidone. A particularly preferred concentration of 

25 human serum albumin is 0.1% by weight. Preferably, the concentration of the high 
molecular weight structural additive ranges from 0. 1% to 10% by weight. 

The amino acids, if present, function to further preserve viral infectivity 
upon cooling and thawing of the aqueous suspension. In addition, amino acids function 
to further preserve viral infectivity during sublimation of the cooled aqueous suspension 

30 and while in the lyophilized state. A preferred amino acid is arginine, but other amino 
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acids such as lysine, ornithine, serine, glycine, glutamine, asparagine, glutamic acid or 
aspartic acid can also be used. A particularly preferred arginine concentration is 0.1% 
by weight. Preferably, the amino acid concentration ranges from 0.1% to 10% by 
weight. 

5 The buffering component acts to buffer the solution by maintaining a 

relatively constant pH. A variety of buffers may be used, depending on the pH range 
desired, preferably between 7.0 and 7.8. Suitable buffers include phosphate buffer and 
citrate buffer. A particularly preferred pH of the recombinant virus formulation is 7.4, 
and a preferred buffer is tromethamine. 
10 In addition, it is preferable that the aqueous solution contain a neutral 

salt which is used to adjust the final formulated recombinant alphavirus to an 
appropriate iso-osmotic salt concentration. Suitable neutral salts include sodium 
chloride, potassium chloride or magnesium chloride. A preferred salt is sodium 
chloride. 

15 Aqueous solutions containing the desired concentration of the 

components described above may be prepared as concentrated stock solutions. 

It will be evident to those skilled in the art, given the disclosure provided 
herein, that it may be preferable to utilize certain saccharides within the aqueous 
solution when the lyophilized virus is intended for storage at room temperature. More 

20 specifically, it is preferable to utilize disaccharides, such as lactose or trehalose, 
particularly for storage at room temperature. 

The lyophilized or dehydrated viruses of the subject invention may be 
reconstituted using a variety of substances, but are preferably reconstituted using water. 
In certain instances, dilute salt solutions which bring the final formulation to isotonicity 

25 may also be used. In addition, it may be advantageous to use aqueous solutions 
containing components known to enhance the activity of the reconstituted virus. Such 
components include cytokines, such as IL-2, polycations, such as protamine sulfate, or 
other components which enhance the transduction efficiency of the reconstituted virus. 
Lyophilized or dehydrated recombinant virus may be reconstituted with any convenient 
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volume of water or the reconstituting agents noted above that allow substantial, and 
preferably total solubilization of the lyophilized or dehydrated sample. 

I. Methods for Utilizing Gene Delivery Vehicles 
5 As noted above, the present invention also provides methods for 

delivering a selected heterologous sequence to a vertebrate (e.g., a mammal such as a 
human or other warm-blooded animal such as a horse, cow, pig, sheep, dog, cat, rat or 
mouse) or insect, comprising the step of administering to a vertebrate or insect a gene 
delivery vehicle as described herein which is capable of expressing the selected 
10 heterologous sequence. Such gene delivery vehicles may be administered either 
directly (e.g., intravenously, intramuscularly, intraperitoneally, subcutaneously, orally, 
rectally, intraocularly, intranasally), or by various physical methods such as lipofection 
(Feigner et aL, Proc. Natl. Acad. Sci. USA 54:7413-7417, 1989), direct DNA injection 
(Fung et aL, Proc. Natl. Acad. Sci. USA 50:353-357, 1983; Seeger et al., Proc. Natl. 

15 Acad. Sci. USA 57:5849-5852; Acsadi et aL, Nature JJ2:81 5-818, 1991); 
microprojectile bombardment (Williams et aL, PNAS 55:2726-2730, 1991); liposomes 
of several types (see, e.g., Wang et aL, PNAS 54:7851-7855, 1987); CaP0 4 (Dubensky 
et aL, PNAS 5/:7529-7533, 1984); DNA hgand (Wu et al, J. BioL Chem. 264:16985- 
16987, 1989); administration of nucleic acids alone (WO 90/1 1092); or administration 

20 of DNA linked to killed adenovirus (Curiel et aL, Hum. Gene Ther. J:147-154, 1992); 
via polycation compounds such as polylysine. utilizing receptor specific ligands: as well 
as with psoralen inactivated viruses such as Sendai or Adenovirus. In addition, the gene 
delivery vehicles may either be administered directly (i.e., in vivo), or to cells which 
have been removed (ex vivo), and subsequently returned. 

25 As discussed in more detail below, gene delivery vehicles may be 

administered to a vertebrate or insect for a wide variety of therapeutic and/or other 
productive purposes, including for example, for the purpose of stimulating a specific 
immune response; inhibiting the interaction of an agent with a host cell receptor; to 
express a toxic palliative, including for example, conditional toxic palliatives; to 

30 immunologically regulate the immune system; to prevent cell division, to express 
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markers, for replacement gene therapy, to promote wound healing and/or to produce a 
recombinant protein. These and other uses are discussed in more detail below. 

1. Tmmunostimulation 
5 Within one aspect of the present invention, compositions and methods 

are provided for administering a gene delivery vehicle which is capable of preventing, 
inhibiting, stabilizing or reversing infectious, cancerous, auto-immune or immune 
diseases. Representative examples of such diseases include viral infections such as 
HP/, HBV. HCV. HTLV L KTLV II. CMV, EBV and HPV, melanomas, diabetes, graft 
10 vs. host disease. Alzheimer's disease and heart disease. More specifically, within one 
aspect of the present invention, compositions and methods are provided for stimulating 
an immune response (either humoral or cell-mediated) to a pathogenic agent, such that 
the pathogenic agent is either killed or inhibited. Representative examples of 
pathogenic agents include bacteria, fungi, parasites, viruses and cancer cells. 
15 Within one embodiment of the invention the pathogenic agent is a virus, 

and methods are provided for stimulating a specific immune response and inhibiting 
viral spread by using a gene delivery vehicle that directs the expression of an antigen or 
modified form thereof to susceptible target cells capable of either (1) initiating an 
immune response to the viral antigen or (2) preventing the viral spread by occupying 
20 cellular receptors required for viral interactions. Expression of the vector nucleic acid 
encoded protein may be transient or stable with time. Where an immune response is to 
be stimulated to a pathogenic antigen, the gene delivery vehicle is preferably designed 
to express a modified form of the antigen which will stimulate an immune response and 
which has reduced pathogenicity relative to the native antigen. This immune response 
25 is achieved when cells present antigens in the correct manner, i.e., in the context of the 
MHC class I and/or II molecules along with accessory molecules such as CD3, ICAM- 
1, ICAM-2, LFA-1, or analogues thereof {e.g. , Altmann et al., Nature 338:512, 1989). 
Cells infected with gene delivery vehicles are expected to do this efficiently because 
they closely mimic genuine viral infection and because they: (a) are able to infect non- 
30 replicating cells, (b) do not integrate into the host cell genome, (c) are not associated 
with any life threatening diseases, and (d) express high levels of heterologous protein. 
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Because of these differences, gene delivery vehicles can easily be thought of as safe 
viral vectors which can be used on healthy individuals for vaccine use. 

This aspect of the invention has a further advantage over other systems 
that might be expected to function in a similar manner, in that the presenter cells are 
5 fully viable and healthy, and low levels of viral antigens, relative to heterologous genes, 
are expressed. This presents a distinct advantage since the antigenic epitopes expressed 
can be altered by selective cloning of sub-fragments of the gene for the antigen into the 
recombinant alphavirus. leading to responses against immunogenic epitopes which may 
otherwise be overshadowed by immunodominant epitopes. Such an approach may be 

10 extended to the expression of a peptide having multiple epitopes, one or more of the 
epitopes being derived from different proteins. Further, this aspect of the invention 
allows efficient stimulation of cyiotoxic T lymphocytes (CTL) directed against 
antigenic epitopes, and peptide fragments of antigens encoded by sub-fragments of 
genes, through intracellular synthesis and association of these peptide fragments with 

15 MHC Class I molecules. This approach may be utilized to map major 
immunodominant epitopes for CTL induction. 

An immune response may also be achieved by transferring to an 
appropriate immune ceil (such as a T lymphocyte) the gene for the specific T cell 
receptor which recognizes the antigen of interest (in the context of an appropriate MHC 

20 molecule if necessary), for an immunoglobulin which recognizes the antigen of interest, 
or for a hybrid of the two which provides a CTL response in the absence of the MHC 
context. Thus, the gene delivery vehicle cells may be used as an imrnunostimulant, 
immunomodulator, or vaccine. 

In another embodiment of the invention, methods are provided for 

25 producing inhibitor palliatives wherein gene delivery vehicles deliver and express 
defective interfering viral structural proteins, which inhibit viral assembly. Such gene 
delivery vehicles may encode defective gag, poi, env or other viral panicle proteins or 
peptides and these would inhibit in a dominant fashion the assembly of viral particles. 
This occurs because the interaction of normal subumts of the viral particle is disturbed 

30 by interaction with the defective subunits. 
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In another embodiment of the invention, methods are provided for the 
expression of inhibiting peptides or proteins specific for viral protease. Briefly, viral 
protease cieaves the viral gag and gag/pol proteins into a number of smaller peptides. 
Failure of this cleavage in all cases leads to complete inhibition of production of 
5 infectious retroviral particles. As an example, the HIV protease is known to be an 
aspartyl protease and these are known to be inhibited by peptides made from amino 
acids from protein or analogues. Gene delivery vehicles to inhibit HIV will express one 
or multiple fused copies of such peptide inhibitors. 

Another embodiment involves the delivery of suppressor genes which, 

10 when deleted, mutated, or not expressed in a cell type, lead to tumorigenesis in that cell 
type. Reintroduction of the deleted gene by means of a gene delivery vehicle leads to 
regression of the rumor phenotype in these cells. Examples of such cancers are 
retinoblastoma and Wilms Tumor. Since malignancy can be considered to be an 
inhibition of cellular terminal differentiation compared with cell growth, the alphavirus 

15 vector delivery and expression of gene products which lead to differentiation of a tumor 
should also, in general, lead to regression. 

In yet another embodiment, the gene delivery vehicle provides a 
therapeutic effect by transcribing a ribozyme (an RNA enzyme) (Haseloff and Gerlach, 
Nature 334:585, 1989) which will cleave and hence inactivate RNA molecules 

20 corresponding to a pathogenic function. Since ribozymes function by recognizing a 
specific sequence in the target RNA and this sequence is normally 12 to 17 bp, this 
allows specific recognition of a particular RNA species such as a RNA or a retroviral 
genome. Additional specificity may be achieved in some cases by making this a 
conditional toxic palliative (see below). 

25 One way of increasing the effectiveness of inhibitory palliatives is to 

express viral inhibitory genes in conjunction with the expression of genes which 
increase the probability of infection of the resistant cell by the virus in question. The 
result is a nonproductive "dead-end" event which would compete for productive 
infection events. In the specific case of HIV, gene delivery vehicles may be delivered 

30 which inhibit HIV replication (by expressing anti-sense tat, etc., as described above) 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/21062 



and also overexpress proteins required for infection, such as CD4. In this way, a 
relatively small number of vector-infected HIV-resistant ceils act as a "sink" or 
"magnet" for multiple nonproductive fusion events with free vims or viraily infected 
ceils. 

5 

2. Blocking Agents 

Many infectious diseases, cancers, autoimmune diseases, and other 
diseases involve the interaction of viral particles with cells, cells with cells, or cells with 
factors produced by themselves or other cells. In viral infections, viruses commonly 

10 enter cells via receptors on the surface of susceptible ceils. In cancers or other 
proliferative conditions {e.g., restenosis), cells may respond inappropriately or not at all 
to signals from other cells or factors, or specific factors may be mutated, overexpressed, 
or underexpressed. resulting in loss of appropriate cell cycle control. In autoimmune 
disease, there is inappropnate recognition of "self markers. Within the present 

1 5 invention, such interactions may be blocked by producing, in vivo, an analogue to either 
of the partners in an interaction. Alternatively, cell cycle control may be restored by 
preventing the transition from one phase to another {e.g., Gl to S phase) using a 
blocking factor which is absent or underexpressed. This blocking action may occur 
intracellularly, on the cell membrane, or extracellularly. and the action of an alphavirus 

20 vector carrying a gene for a blocking agent, can be mediated either from inside a 
susceptible cell or by secreting a version of the blocking protein to locally block the 
pathogenic interaction. 

In the case of HIV, the two agents of interaction are the gp 120/gp 41 
envelope protein and the CD4 receptor molecule. Thus, an appropriate blocker would 

25 be a gene delivery vehicle expressing either an HIV env analogue that blocks HIV entry 
without causing pathogenic effects, or a CD4 receptor analogue. The CD4 analogue 
would be secreted and would function to protect neighboring cells, while the gp 120/gp 
41 is secreted or produced only intracellularly so as to protect only the vector- 
containing cell. It may be advantageous to add human immunoglobulin heavy chains or 

30 other components to CD4 in order to enhance stability or complement lysis. 
Administration of a gene delivery vehicle encoding such a hybrid-soluble CD4 to a host 
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results in a continuous supply of a stable hybrid molecule. Efficacy of treatment can be 
assayed by measuring the usual indicators of disease progression, including antibody 
level, viral antigen production, infectious HIV levels, or levels of nonspecific 
infections. 

5 In the case of uncontrolled proliferative states, such as cancer or 

restenosis, cell cycle progression may be halted by the expression of a number of 
different factors that affect signaling by cyciins or cyclin-dependent kinases (CDK). 
For example, the cyclin-dependent kinase inhibitors, p 1 6, p2l, and p27 each regulate 
cyclin:CDK mediated cell cycle signaling. Overexpression of these factors within a cell 

10 by a gene delivery vehicle results in a cytostatic suppression of cell proliferation. Other 
factors that may be used therapeutically, as blocking agents or targets to disrupt cell 
proliferation, include, for example, wild-type or mutant Rb, p53, Myc, Fos. Jun, PCNA, 
GAX, lenti viral vpr and pi 5. Within related embodiments, cardiovascular diseases such 
as restenosis or atherosclerosis may be treated or prevented with vectors that express 

15 products which promote re-endothelialization, or vascular remodeling {e.g., VEGF, 
TFPI, SOD). . 

3. Ex pression of Palliatives 

Techniques similar to those described above can be used to produce gene 

20 deliver.' vehicles which direct the expression of an agent (or "palliative") which is 
capable of inhibiting a function of a pathogenic agent or gene. Within the present 
invention, "capable of inhibiting a function" means that the palliative either directly 
inhibits the function or indirectly does so, for example, by convening an agent present 
in the cells from one which would not normally inhibit a function of the pathogenic 

25 agent to one which does. Examples of such functions for viral diseases include 
adsorption, replication, gene expression, assembly, and exit of the vims from infected 
cells. Examples of such functions for a cancerous cell, cancer-promoting growth factor, 
or uncontrolled proliferative condition (e.g., restenosis) include viability, cell 
replication, altered susceptibility to external signals (e.g., contact inhibition), and lack 

30 of production or production of mutated forms of anti-oncogene proteins. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/21Q62 



76 



a. Inhibitor Palliatives 

In one aspect of the present invention, the gene delivery vehicle directs 
the expression of a gene which can interfere with a function of a pathogenic agent, for 
5 instance in viral or malignant diseases. Such expression may either be essentially 
continuous or in response to the presence in the cell of another agent associated either 
with the pathogenic condition or with a specific cell type (an "identifying agent"). In 
addition, vector delivery may be controlled by targeting vector entry specifically to the 
desired cell type (for instance, a virally infected or malignant cell) as discussed above. 

10 One method of administration is ieukophoresis, in which about 20% of 

an individual's PBLs are removed at any one time and manipulated in vitro. Thus, 
approximately 2 x 10 g cells may be treated and replaced. Repeat treatments may also be 
performed. Alternatively, bone marrow may be treated and allowed to amplify the 
effect as described above. In addition, packaging cell lines producing a vector may be 

15 directly injected into a subject, allowing continuous production of recombinant virions. 

In one embodiment, gene delivery vehicles which express RNA 
complementary to key pathogenic gene transcripts (for example, a viral gene product or 
an activated cellular oncogene) can be used to inhibit translation of that transcript into 
protein, such as the inhibition of translation of the HIV tat protein. Since expression of 

20 this protein is essential for viral replication, cells containing the gene delivery vehicle 
would be resistant to HIV replication. 

In a second embodiment, where the pathogenic agent is a single-stranded 
virus having a packaging signal, RNA complementary to the viral packaging signal 
(e.g., an HIV packaging signal when the palliative is directed against HIV) is expressed, 

25 so that the association of these molecules with the viral packaging signal will, in the 
case of retroviruses, inhibit stem loop formation or tRNA primer binding required for 
proper encapsidation or replication of the alphavirus RNA genome. 

In a third embodiment, a gene delivery vehicle may be introduced which 
expresses a palliative capable of selectively inhibiting the expression of a pathogenic 

30 gene, or a palliative capable of inhibiting the activity of a protein produced by the 
pathogenic agent. In the case of HIV, one example is a mutant tat protein which lacks 
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the ability to transactivate expression from the HIV LTR and interferes (in a 
transdominant manner) with the normal functioning of tat protein. Such a mutant has 
been identified for HTLV II tat protein ("XII Leu 5n mutant; see Wachsman et al., 
Science 235:61 '4, 1987). A mutant transrepressor tat should inhibit replication much as 
5 has been shown for an analogous mutant repressor in HSV-1 (Friedmann et al., Nature 
335:452, 1988). 

Such a transcriptional repressor protein can be selected for in tissue 
culture using any viral-specific transcriptional promoter whose expression is stimulated 
by a virus-specific transactivating protein (as described above). In the specific case of 

10 HIV, a cell line expressing HIV tat protein and the HSVTK gene driven by the HIV 
promoter will die in the presence of ACV. However, if a series of mutated tat genes are 
introduced to the system, a mutant with the appropriate properties (i.e., represses 
transcription from the HIV promoter in the presence of wild-type tat) will grow and be 
selected. The mutant gene can then be reisolated from these cells. A cell line 

15 containing multiple copies of the conditionally lethal vector/tat system may be used to 
assure that surviving cell clones are not caused by endogenous mutations in these genes. 
A battery of randomly mutagenized tat genes are then introduced into these cells using a 
"rescuable" alphavirus vector (i.e., one that expresses the mutant tat protein and 
contains a bacterial origin of replication and drug resistance marker for growth and 

20 selection in bacteria). This allows a large number of random mutations to be evaluated 
and permits facile subsequent molecular cloning of the desired mutant cell line. This 
procedure may be used to identify and utilize mutations in a variety of viral 
transcriptional activator/ viral promoter systems for potential antiviral therapies. 

25 b. Conditional Toxic Palliatives 

Another approach for inhibiting a pathogenic agent is to express a 
palliative which is toxic for the cell expressing the pathogenic condition. In this case, 
expression of the palliative from the gene delivery vehicle should be limited by the 
presence of an entity associated with the pathogenic agent, such as a specific viral RNA 

30 sequence identifying the pathogenic state, in order to avoid destruction of 
nonpathogenic cells. 
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In one embodiment of this method, a gene delivery vehicle can be 
utilized to express a toxic gene (as discussed above) from a cell-specific responsive 
vector, in this manner, rapidly replicating ceils, which contain the RNA sequences 
capable of activating the cell-specific responsive vectors, are preferentially destroyed by 
5 the cytotoxic agent produced by the gene delivery vehicle. 

In a similar manner to the preceding embodiment, the gene delivery 
vehicle can carry a gene for phosphorylation, phosphoribosylation, ribosylation, or 
other metabolism of a purine- or pyrimidine-based drug. This gene may have no 
equivalent in mammalian cells and might come from organisms such as a virus, 

10 bacterium, fungus, or protozoan. An example of this would be the E. coli guanine 
phosphoribosyl transferase gene product, which is lethal in the presence of thioxanthine 
{see Besnard et aL Mol. Cell. Biol. 7:4139-4141, 1987). Conditionally lethal gene 
products of this type (also referred to as "prodrugs converting enzymes" above) have 
application to many presently known purine- or pyrimidine-based anticancer drugs, 

15 which often require intracellular ribosylation or phosphorylation in order to become 
effective cytotoxic agents. The conditionally lethal gene product could also metabolize 
a nontoxic drug which is not a purine or pyrimidine analogue to a cytotoxic form {see 
Searie et aL, Brit. J. Cancer 5J:377-384, 1986). 

Mammalian viruses in general tend to have "immediate early" genes 

20 which are necessary for subsequent transcriptional activation from other viral promoter 
elements. RNA sequences of this nature are excellent candidates for activating 
alphavirus vectors intracellular signals (or "identifying agents") of viral infection. 
Thus, conditionally lethal genes expressed from alphavirus cell-specific vectors 
responsive to these viral "immediate early" gene products could specifically kill cells 

25 infected with any particular virus. Additionally, since the human and interferon 
promoter elements are transcriptionally activated in response to infection by a wide 
variety of nonrelated viruses, the introduction of vectors expressing a conditionally 
lethal gene product like HSVTK, for example, in response to interferon production 
could result in the destruction of cells infected with a variety of different viruses. 
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In another aspect of the present invention, gene delivery vehicles are 
provided which direct the expression of a gene product capable of activating an 
otherwise inactive precursor into an active inhibitor of the pathogenic agent. For 
example, the HSVTK gene product may be used to more effectively metabolize poten- 
5 tially antiviral nucleoside analogues such as AZT or ddC. The HSVTK gene may be 
expressed under the control of a cell-specific responsive vector and introduced into 
these cell types. AZT (and other nucleoside antivirals) must be metabolized by cellular 
mechanisms to the nucleotide triphosphate form in order to specifically inhibit retroviral 
reverse transcriptase, and thus, HIV replication (Furmam et aL Proc. Natl. Acad. Sci. 

10 USA £.3:8333-8337. 1986). Constitutive expression of HSVTK (a nucleoside and 
nucleoside kinase with very broad substrate specificity) results in more effective 
metabolism of these drugs to their biologically active nucleotide triphosphate form. 
AZT or ddC therapy will thereby be more effective, allowing lower doses, less 
generalized toxicity, and higher potency against productive infection. Additional 

15 nucleoside analogues whose nucleotide triphosphate forms show selectivity for 
retroviral reverse transcriptase but. as a result of the substrate specificity of cellular 
nucleoside and nucleotide kinases are not phosphorylated, will be made more 
efficacious. 

Administration of these gene delivery vehicles to human T cell and 
20 macrophage/monocyte cell lines can increase their resistance to HIV in the presence of 
AZT and ddC compared to the same cells without retroviral vector treatment. 
Treatment with AZT would be at lower than normal levels to avoid toxic side effects 
but still efficiently inhibit the spread of HIV. The course of treatment would be as 
described for the blocker. 
25 In one embodiment, the gene delivery vehicle carries a gene specifying a 

product which is not in itself toxic but, when processed or modified by a protein such as 
a protease specific to a viral or other pathogen, is converted into a toxic form. For 
example, the gene delivery vehicle could carry a gene encoding a proprotein for ricin A 
chain, which becomes toxic upon processing by the HIV protease. More specifically, a 
30 synthetic inactive proprotein form of the toxin ricin or diphtheria A chains could be 
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cleaved to the active form by arranging for the HIV virally encoded protease to 
recognize and cleave off an appropriate "pro" element. 

In another embodiment, the gene delivery vehicle may express a 
"reporting product" on the surface of the target cells in response to the presence of an 
5 identifying agent in the cells (such as expression of a viral gene). This surface protein 
can be recognized by a cytotoxic agent, such as antibodies for the reporting protein, or 
by cytotoxic T cells. In a similar manner, such a system can be used as a detection 
system {see below) to simply identify those cells having a particular gene which 
expresses an identifying protein. 
10 Similarly, in another embodiment, a surface protein could be expressed 

which would itself be therapeutically beneficial. In the particular case of HIV. 
expression of the human CD4 protein specifically in HIV-infected ceils may be 
beneficial in two ways: 

1. Binding of CD4 to HIV env intracellular^ could inhibit the 
15 formation of viable viral panicles, much as soluble CD4 has been shown to do for free 

virus, but without the problem of systematic clearance and possible immunogenicity, 
since the protein will remain membrane bound and is structurally identical to 
endogenous CD4 (to which the patient should be immunologically tolerant). 

2. Since the CD4/HIV env complex has been implicated as a cause 
20 of cell death, additional expression of CD4 (in the presence of excess HIV-env present 

in HIV-infected cells) leads to more rapid cell death and thus inhibits viral 
dissemination. This may be particularly applicable to monocytes and macrophages, 
which act as a reservoir for vims production as a result of their relative retractility to 
HIV-induced cytotoxicity (which, in turn, is apparently due to the relative lack of CD4 

25 on their cell surfaces). 

In another embodiment, the gene delivery vehicle can provide a 
ribozyme which will cleave and inactivate RNA molecules essential for viability of the 
vector infected cell. By making ribozyme production dependent on a specific RNA 
sequence corresponding to the pathogenic state, such as HIV tat, toxicity is specific to 

30 the pathogenic state. 
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4. Expression of Markers 

The above-described technique of expressing a palliative in a cell in 
response to a specific RNA sequence can also be modified to enable detection of a 
5 particular gene in a cell which expresses an identifying protein (for example, a gene 
carried by a particular virus), and hence enable detection of cells carrying that virus. In 
addition, this technique enables the detection of viruses (such as HIV) in a clinical 
sample of cells carrying an identifying protein associated with the virus. 

This modification can be accomplished by providing a genome coding 

1 0 for a product, the presence of which can be readily identified (the "marker product"), in 
a gene delivery vehicle which responds to the presence of the identifying protein in the 
infected cells. For example, HIV, when it infects suitable cells, makes tat and rev. The 
indicator cells can thus be provided with a genome (such as by infection with an 
appropriate recombinant alphavirus) which codes for a marker gene, such as the alkaline 

15 phosphatase gene, [3-galactosidase gene, or the luciferase gene which is expressed by 
the recombinant alphavirus upon activation by the tat and/or rev RNA transcript. In the 
case of p-galactosidase or alkaline phosphatase, exposing the cells to substrate 
analogues results in a color or fluorescence change if the sample is positive for HIV. In 
the case of luciferase. exposing the sample to luciferin will result in luminescence if the 

20 sample is positive for HIV. For intracellular enzymes such as p-galactosidase, the viral 
titre can be measured directly by counting colored or fluorescent cells, or by making 
cell extracts and performing a suitable assay. For the membrane bond form of alkaline 
phosphatase, virus titre can also be measured by performing enzyme assays on the cell 
surface using a fluorescent substrate. For secreted enzymes, such as an engineered form 

25 of alkaline phosphatase, small samples of culture supernatant are assayed for activity, 
allowing continuous monitoring of a single culture over time. Thus, different forms of 
this marker system can be used for different purposes. These include counting active 
virus, or sensitively and simply measuring viral spread in a culture and the inhibition of 
this spread by various drugs. 

30 Further specificity can be incorporated into the preceding system by 

testing for the presence of the virus either with or without neutralizing antibodies to that 
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virus. For exampie : in one portion of the clinical sample being tested, neutralizing 
antibodies to HIV may be present; whereas in another portion there would be no 
neutralizing antibodies. If the tests were negative in the system where there were 
antibodies and positive where there were no antibodies, this would assist in confirming 
5 the presence of HIV. 

Within an analogous system for an in vitro assay, the presence of a 
particular gene, such as a viral gene, may be determined in a cell sample. In this case, 
the cells of the sample are infected with a suitable gene delivery vehicle which carries 
the reporter gene which is only expressed in the presence of the appropriate viral RNA 

10 transcript. The reporter gene, after entering the sample cells, will express its reporting 
product (such as (5-galactosidase or luciferase) only if the host cell expresses the 
appropriate viral proteins. 

These assays are more rapid and sensitive, since the reporter gene can 
express a greater amount of reporting product than identifying agent present, which 

1 5 results in an amplification effect. 

5. Immune Down-Regulation 

As described above, the present invention also provides gene delivery 
vehicles capable of suppressing one or more elements of the immune system in target 

20 cells infected with the alphavirus. Briefly, specific down-regulation of inappropriate or 
unwanted immune responses, such as in chronic hepatitis or in transplants of 
heterologous tissue such as bone marrow, can be engineered using immune-suppressive 
viral gene products which suppress surface expression of transplantation (MHC) 
antigen. Group C adenoviruses Ad2 and Ad5 possess a 19 kd glycoprotein (gp 19) 

25 encoded in the E3 region of the vims. This gp 19 molecule binds to class I MHC 
molecules in the endoplasmic reticulum of cells, and prevents terminal glycosylation 
and translocation of class I MHC to the cell surface. For example, prior to bone marrow 
transplantation, donor bone marrow cells may be infected with a gp 19-encoding gene 
delivery vehicle which, upon expression of the gp 19, inhibit the surface expression of 

30 MHC class I transplantation antigens. These donor cells may be transplanted with low 
risk of graft rejection and may require a minimal immunosuppressive regimen for the 
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transplant patient. This may allow an acceptable donor-recipient chimeric state to exist 
with fewer complications. Similar treatments may be used to treat the range of so- 
called autoimmune diseases, including lupus erythromiatis, multiple sclerosis, 
rheumatoid arthritis or chronic hepatitis B infection. 
5 An alternative method involves the use of anti-sense message, ribozyme, 

or other specific gene expression inhibitor specific for T cell clones which are 
autoreactive in nature. These block the expression of the T cell receptor of particular 
unwanted clones responsible for an autoimmune response. The anti-sense, ribozyme, or 
other gene may be introduced using the viral vector delivery system. 

10 

6. Replacement or Augmentation Gene Therapy 

One further aspect of the present invention relates to transforming cells 
of a vertebrate or insect with a gene delivery vehicle which supplies genetic sequences 
capable of expressing a therapeutic protein. Within one embodiment of the present 

15 invention, the gene delivery vehicle is designed to express a therapeutic protein capable 
of preventing, inhibiting, stabilizing or reversing an inherited or nomnherited genetic 
defect in metabolism, immune regulation, hormonal regulation, enzymatic or membrane 
associated structural function. This embodiment also describes the gene delivery 
vehicle capable of transducing individual cells, whereby the therapeutic protein is able 

20 to be expressed systemically or locally from a specific cell or tissue, whereby the 
therapeutic protein is capable of (a) the replacement of an absent or defective cellular 
protein or enzyme, or (b) supplement production of a defective of low expressed 
cellular protein or enzyme. Such diseases may include cystic fibrosis, Parkinson's 
disease, hypercholesterolemia, adenosine deaminase deficiency. C-globin disorders, 

25 Hemophilia A & B, Gaucher's disease, diabetes and leukemia. 

As an example of the present invention, a gene delivery vehicle can be 
constructed and utilized to treat Gaucher disease. Briefly, Gaucher disease is a genetic 
disorder that is characterized by the deficiency of the enzyme glucocerebrosidase. This 
type of therapy is an example of a single gene replacement therapy by providing a 

30 functional cellular enzyme. This enzyme deficiency leads to the accumulation of 
glucocerebroside in the lysosomes of all cells in the body. However, the disease 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 PO7US98/21062 

84 

phenotype is manifested only in the macrophages, except in the very rare neuronpathic 
forms of the disease. The disease usually leads to enlargement of the liver and spleen 
and lesions in the bones. (For a review, see Science 25(5:794, 1992. and The Metabolic 
Basis of Inherited Disease, 6th ed.. Scriver et al., vol. 2, p. 1677). 
5 Gene delivery vehicles can similarly be utilized to deliver a wide variety 

of therapeutic proteins in order to treat, cure, prevent a disease or disease process. 
Representative examples of such genes include, but are not limited to, insulin (see U.S. 
4,431,740 and BE 8S5196A), hemoglobin (Lawn et al., Cell 27:647-51, 1980), 
erythropoietin (EPO: see U.S. 4,703,008), megakaryocyte growth and differentiation 

10 factor (MGDF), stem cell factor (SCF), G-CSF (Nagata et al., Xaiure J79:415-41S, 
19S6). GM-CSF. M-CSF (see WO 8706954), the fit3 ligand (Lyman et al. (1993), Cell 
75:1 157-1167), EGF, acidic and basic FGF, PDGF, members of the interleukin or 
interferon families, supra, neurotropic factors {e.g., BDNF: Rosenthal et al., 
Endocrinology 729:1289-1294, 1991. NT-3; see WO 9103569, CNTF; see WO 

15 9104316, NGF; see WO 9310150), coagulation factors (e.g., factors VIII and IX), 
thrombolytic factors such as t-PA (see EP 292009, AU S653302 and EP 174835) and 
streptokinase (see EP 407942), human growth hormone (see TP 94030582 and U.S. 
4,745,069) and other animal somatotropins, integrins and other cell adhesion molecules, 
such as ICAiM-1 and ELAM (see also other "heterologous sequences" discussed above), 

20 and other growth factors, such as IGF-I and IGF-II, TGF-p, osteogenic protein- 1 
(Ozkaynak et al.. EMBO J. 9:2085-2093, 1990), and other bone morphogenetic proteins 
(e.g., BMP-4, Nakase et al. J. Bone Miner. Res. P:651-659. 1994). 

7. Lymphokines and Lvmphokine Receptors 

25 As noted above, the present invention also provides gene delivery 

vehicles which can. among other functions, direct the expression of one or more 
cytokines or cytokine receptors. Briefly, in addition to their role as cancer therapeutics, 
cytokines can have negative effects resulting in certain pathological conditions. For 
example, most resting T-cells, B cells, large granular lymphocytes and monocytes do 

30 not express 1L-2R (receptor). In contrast to the lack of IL-2R expression on normal 
resting cells, IL-2R is expressed by abnormal cells in patients with certain leukemias 
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(ATL, Hairy-cell. Hodgkins. acute and chronic granulocytic), autoimmune diseases, and 
is associated with allograft rejection. Interestingly, in most of these patients the serum 
concentration of a soluble form of IL-2R is elevated. Therefore, with certain 
embodiments of the invention therapy may be effected by increasing the serum 
5 concentration of the soluble form of the cytokine receptor. For example, in the case of 
IL-2R. a gene delivery vehicle can be engineered to produce both soluble IL-2R and IL- 
2R, creating a high affinity soluble receptor. In this configuration, serum IL-2 levels 
would decrease, inhibiting the paracrine loop. This same strategy also may be effective 
against autoimmune diseases. In particular, because some autoimmune diseases {e.g., 

10 Rheumatoid arthritis. SLE) also are associated with abnormal expression of IL-2, 
blocking the action of IL-2 by increasing the serum level of receptor may also be 
utilized in order to treat such autoimmune diseases. 

In other cases inhibiting the levels of IL-1 may be beneficial. Briefly, 
IL-1 consists of two polypeptides, IL-1 and IL-1, each of which has plieotropic effects. 

15 IL-1 is primarily synthesized by mononuclear phagocytes, in response to stimulation by 
microbial products or inflammation. There is a naturally occurring antagonist of the 
IL-1R, referred to as the IL-1 Receptor antagonist ("IL-lRa"). This IL-1R antagonist 
has the same molecular size as mature IL-1 and is structurally related to it. However, 
binding of IL-lRa to the IL-1R does not initiate any receptor signaling. Thus, this 

20 molecule has a different mechanism of action than a soluble receptor, which complexes 
with the cytokine and thus prevents interaction with the receptor. IL-1 does not seem to 
play an important role in normal homeostasis. In animals, antibodies to IL-1 receptors 
reduce inflammation and anorexia due to endotoxins and other inflammation inducing 
agents. 

25 In the case of septic shock, IL-1 induces secondary compounds which 

are potent vasodilators. In animals, exogenously supplied IL-1 decreases mean arterial 
pressure and induces leukopenia. Neutralizing antibody to IL-1 reduced endotoxin- 
induced fever in animals. In a study of patients with septic shock who were treated with 
a constant infusion of IL-1 R for three days, the 28 day mortality was 16% compared to 

30 44% in patients who received placebo infusions. In the case of autoimmune disease, 
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reducing the activity of IL-1 reduces inflammation. Similarly, blocking the activity of 
IL-1 with recombinant receptors can result in increased allograft survival in animals, 
again presumably by decreasing inflammation. 

These diseases provide further examples where gene delivery vehicles 
5 may be engineered to produce a soluble receptor or more specifically the IL-IRa 
moiecule. For example, in patients undergoing septic shock, a single injection of IL- 
IRa producing vector particles could replace the current approach requiring a constant 
infusion of recombinant IL-1R. 

Cytokine responses, or more specifically, incorrect cytokine responses 

10 may also be involved in the failure to control or resolve infectious diseases. Perhaps the 
best studied example is non-healing forms of leishmaniasis in mice and humans which 
have strong, but counterproductive T H 2-dominated responses. Similarly, 
lepromotomatous leprosy is associated with a dominant, but inappropriate T H 2 response, 
in these conditions, gene delivery vehicles may be useful for increasing circulating 

15 levels of IFN gamma, as opposed to the site-directed approach proposed for solid tumor 
therapy. IFN gamma is produced by T H -1 T-cells, and functions as a negative regulator 
of T H -2 subtype proliferation. IFN gamma also antagonizes many of the IL-4 mediated 
effects on B-cells, including isotype switching to IgE. 

IgE, mast cells and eosinophils are involved in mediating allergic 

20 reaction. IL-4 acts on differentiating T-cells to stimulate T H -2 development, while 
inhibiting T H -1 responses. Thus, alphavirus-based gene therapy may also be 
accomplished in conjunction with traditional allergy therapeutics. One possibility is to 
deliver a gene delivery vehicle which produces IL4R with small amounts of the 
offending allergen (i.e., traditional allergy shots). Soluble IL-4R would prevent the 

25 activity of IL-4, and thus prevent the induction of a strong T H -2 response. 

8. Suicide Vectors 

One further aspect of the present invention relates to the use of gene 
delivery vehicle suicide vectors to limit the spread of wild-type alphavirus in the 
30 packaging/producer cell lines. Briefly, within one embodiment the gene delivery 
vehicle is comprised of an antisense or ribozyme sequence specific for the wild-type 
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alphavirus sequence generated from an RNA recombination event between the 3' 
sequences of the junction region of the vector, and the 5' alphavirus structural sequences 
of the packaging cell line expression vector. The antisense or ribozyme molecule would 
only be thermostable in the presence of the specific recombination sequence and would 
5 not have any other effect in the alphavirus packaging/producer cell line. Alternatively, 
a toxic molecule (such as those disclosed herein), may also be expressed in the context 
of a vector that would only express in the presence of wild-type alphavirus. 

9. Gene Delivery Vehicles to Prevent the Spread of Metastatic Tumors 

10 One further aspect of the present invention relates to the use of gene 

delivery vehicles for inhibiting or reducing the invasiveness of malignant neoplasms. 
Briefly, the extent of malignancy typically relates to vascularization of the tumor. One 
cause for tumor vascularization is the production of soluble tumor angiogenesis factors 
(TAF) (Paweletz et al.. Crit. Rev. Oncol. Hematoi P: 197, 1989) expressed by some 

15 tumors. Within one aspect of the present invention, tumor vascularization may be 
slowed utilizing gene delivery vehicles to express antisense or ribozyme RNA 
molecules specific for TAF. Alternatively, anti-angiogenesis factors (Moses et al., 
Science 245:1408, 1990; Shapiro et ah, PNAS 84:221%, 1987) may be expressed either 
alone or in combination with the above-described ribozymes or antisense sequences in 

20 order to slow or inhibit tumor vascularization. Alternatively, gene delivery vehicles can 
also be used to express an antibody specific for the TAF receptors on surrounding 
tissues. 

10. Administration of Gene Delivery Vehicles 

25 Within other aspects of the present invention, methods are provided for 

administering a gene delivery vehicle to a vertebrate or insect. Briefly, the final mode 
of gene delivery vehicle administration usually relies on the specific therapeutic 
application, the best mode of increasing vector potency, and the most convenient route 
of administration. Generally, this embodiment includes gene delivery vehicles which 

30 can be designed to be delivered by, for example, (1) direct injection into the blood 
stream; (2) direct injection into a specific tissue or tumor; (3) oral administration; 
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(4) nasal inhalation; (5) direct application to mucosal tissues; or (6) ex vivo 
administration of transduced autologous cells into the vertebrate or insect. Within 
certain embodiments of the invention, for ex vivo applications cells can be first removed 
from a host, positively and/or negatively selected in order to yield a population of cells 
5 which is at least partially purified {e.g., CD34* stem cells, T ceils, or the like), 
transduced, transfected, or, infected with one of the gene delivery vehicles of the 
present invention, and reintroduced into either the same host or another individual. 

Thus, the therapeutic gene delivery vehicle can be administered in such a 
fashion such that the vector can (a) transduce a normal healthy cell and transform the 
10 cell into a producer of a therapeutic protein or agent which is secreted systemicaily or 
locally, (b) transform an abnormal or defective cell, transforming the cell into a normal 
functioning phenotype, (c) transform an abnormal cell so that it is destroyed, and/or 
(d) transduce cells to manipulate the immune response. 

1 1. Modulation of Transcription Factor Activity 

In yet another embodiment, gene delivery vehicles may be utilized in 
order to regulate the growth control activity of transcription factors in the infected cell. 
Briefly, transcription factors directly influence the pattern of gene expression through 
sequence-specific /rci/is-activation or repression (Karin, New Biologist 27:126-131. 
1990). Thus, it is not surprising that mutated transcription factors represent a family of 
oncogenes. Gene delivery vehicles can be used, for example, to return control to tumor 
cells whose unregulated growth is activated by oncogenic transcription factors, and 
proteins which promote or inhibit the binding cooperatively in the formation of homo- 
and heterodimer /rans-activating or repressing transcription factor complexes. 

One method for reversing cell proliferation would be to inhibit the 
rra/w-activating potential of the c-mvc/Max heterodimer transcription factor complex. 
Briefly, the nuclear oncogene c-myc is expressed by proliferating cells and can be 
activated by several distinct mechanisms, including retroviral insertion, amplification, 
and chromosomal translocation. The Max protein is expressed in quiescent cells and, 
independently of c-myc, either alone or in conjunction with an unidentified factor. 



20 
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functions to repress expression of the same genes activated by the myc/Max 
heterodimer (Cole, Cell 55:715-716, 1991). 

Inhibition of c-myc or omvc/Max proliferation of rumor cells may be 
accomplished by the overexpression of Max in target cells controlled by gene delivery 
5 vehicles. The Max protein is only 160 amino acids (corresponding to 480 nucleotide 
RNA length) and is easily incorporated into a gene delivery vehicle either 
independently, or in combination with other genes and/or antisense/nbozyme moieties 
targeted to factors which release growth control of the cell. 

Modulation of homo/hetero-complex association is another approach to 

10 control transcription factor activated gene expression. For example ; transport from the 
cytoplasm to the nucleus of the irans- activating transcription factor NF-B is prevented 
while in a heterodimer complex with the inhibitor protein IB. Upon induction by a 
variety of agents, including certain cytokines, IB becomes phosphorylated and NF-B is 
released and transported to the nucleus, where it can exert its sequence-specific 

15 /raws-activating function (Baeuerie and Baltimore, Science 242:540-546, 1988). The 
dissociation of the NF-B/IB complex can be prevented by masking with an antibody the 
phosphorylation site of IB. This approach would effectively inhibit the rra/i5-activation 
activity of the NF-IB transcription factor by preventing its transport to the nucleus. 
Expression of the IB phosphorylation site specific antibody or protein in target cells 

20 may be accomplished with an alphavirus gene transfer vector. An approach similar to 
the one described here could be used to prevent the formation of the rra/75-activating 
transcription heterodimer factor AP-1 (Turner and Tijan, Science 243: 1689- 1694, 
1989), by inhibiting the association between the jun and fos proteins. 

25 12. Production of Recombinant Proteins 

In another aspect of the present invention, togavirus (including 

alphavirus) gene delivery vehicles can be utilized to direct the expression of one or 

more recombinant proteins in eukaryotic cells (ex vivo, in vivo, or established cell lines). 

As used herein, a "recombinant protein" refers to a protein, polypeptide, enzyme, or 
30 fragment thereof. Using this approach, proteins having therapeutic or other commercial 

application can be more cost-effectively produced. Furthermore, proteins produced in 
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eukaryotic cells may be more authentically modified post-translationally (e.g., 
glycosylated, sulfated, acetylated. etc.), as compared to proteins produced in prokaryotic 
cells. In addition, such systems may be employed in the in vivo production of various 
chemical compounds, e.g., fine or specialty chemicals. 
5 Within this aspect, the gene delivery vehicle encoding the desired 

protein, enzyme, or enzymatic pathway (as may be required for the production of a 
desired chemical) is transformed, transfected. transduced or otherwise introduced into a 
suitable eukaryotic ceil. Representative examples of proteins which can be produced 
using such a system include, but are not limited to, insulin (see U.S. 4,431,740 and BE 

10 885 196 A), hemoglobin (Lawn et al.. Cell 21:641-51, 1980), erythropoietin (EPO; see 
U.S. 4,703,008), megakaryocyte growth and differentiation factor (MGDF), stem cell 
factor (SCF), G-CSF (Nagata et al., Nature i/9:4I5-418, 1986), GM-CSF, M-CSF (see 
WO 8706954), the flt3 ligand (Lyman et al. (1993), Cell 75:1157-1 167), EGF, acidic 
and basic FGF, PDGF, members of the interleukin or interferon families, supra, 

15 neurotropic factors (e.g., BDNF; Rosenthal et al., Endocrinology J 29: 1289-1294, 1991, 
NT-3; see WO 9103569, CNTF; see WO 9104316, NGF; see WO 9310150), 
coagulation factors (e.g., factors VIII and IX), thrombolytic factors such as t-PA (see EP 
292009, AU 8653302 and EP 174835) and streptokinase (see EP 407942), human 
growth hormone (see JP 94030582 and U.S. 4,745,069) and other animal 

20 somatotropins, integrins and other cell adhesion molecules, such as ICAM-1 and ELAM 
(see also other "heterologous sequences" discussed above), and other growth factors, 
such as VEGF, IGF-I and IGF-II, TGF-J3, osteogenic protein- 1 (Ozkaynak et a!., EMBO 
J. 9:2085-2093, 1990), and other bone or cartilage morphogenetic proteins (e.g., BMP- 
4, Nakase et al, J. Bone Miner. Res. 9:651-659, 1994). As those in the art will 

25 appreciate, once characterized, any gene can be readily cloned into gene delivery 
vehicles according to the present invention, followed by introduction into a suitable host 
cell and expression of the desired gene. In addition, such vectors may be delivered 
directly in vivo, either locally or systemically to promote the desired therapeutic effect 
(e.g., wound healing applications). 
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Methods for producing recombinant proteins using the vectors and 
alphavirus packaging cell lines described herein are provided (see examples 6 and 7). 
Briefly, gene delivery vehicles, in the form of in vitro transcribed RNA, plasmid DNA, 
or recombinant vector particles, which encode recombinant proteins, may be introduced 
5 (via transfection or infection) into alphavirus packaging cell lines (PCLs) such that only 
a small fraction of the cultured cells (<1%) contain vector molecules. Vector replicons 
are packaged by the sPs, supplied in trans by the PCL, following vector RNA 
amplification, which proceeds according to the Sindbis virus replication strategy. In 
turn, the produced recombinant vector panicles infect the remaining cells of the culture. 
10 Thus, a bloom of recombinant protein expression results over time as recombinant 
vector particles are produced and subsequently infect all cells in the PCL culture. 
Similarly, amplification of vector panicles with PCL may be used to generate large, 
high titer panicle stocks for other applications. In yet another aspect of this invention, 
recombinant protein expression from producer cell lines is described (see Example 7). 
15 Briefly, cell lines are derived which contain all of the genetic elements, including vector 
replicon and defective helper expression cassettes, from which the production of vector 
panicles can be induced, via addition of an extracellular stimulus to the culture. Thus, 
expression of vector-encoded recombinant protein occurs as a result of induction of 
alphavirus vector panicle producer cell lines. In yet a still funher aspect of this 
20 invention, recombinant protein expression from cell lines stably transformed with 
eukaryotic layered vector initiation systems are described (see Example 7). Briefly, cell 
lines are derived which are stably transformed with an inducible eukaryotic iayered 
vector initiation system cassette that encodes a recombinant protein of interest. Thus, 
expression of vector-encoded recombinant protein occurs as a result of induction of the 
25 eukaryotic layered vector initiation system cassette. 

As should be readily understood given the disclosure provided herein, 
protein production utilizing RNA vectors replicons, eukaryotic layered vector initiation 
systems, or recombinant vector panicles may also be accomplished by methods other 
than introduction into packaging or producer cell lines. For example, such vectors may 
30 be introduced into a wide variety of other eukaryotic host cell lines (e.g., COS, BHK. 
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CHO, 293. or HeLa cells), as well as direct administration in vivo or to ex vivo cells, in 
order to produce the desired protein. 



5 J. Deposit Information 

The following materials have been deposited with the American Type 
Culture Collection: 



Deposit Designation Deposit Date Accession 

No. 

Wild type Sindbis virus CMCC £4639 April 2. 1 996 VR-2526 

SIN-1 Sindbis virus CMCC #4640 April 2, 1996 VR-2527 

pBG-SINl ELVS1.5 SEAP CMCC £4641 April 2, 1996 97502 



0 The above materials were deposited by Chiron Corporation with the 

American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville. 
Maryland under the terms of the Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for purposes of Patent Procedure. The accession 
number is available from the ATCC at telephone number f 30 1 ) 881-2600. 

5 These deposits are provided as convenience to those of skill in the art, 

and are not an admission that a deposit is required under 35 U.S.C. § 112. The nucleic 
acid sequence of these deposits, as well as the amino acid sequence of the polypeptides 
encoded thereby, are incorporated herein by reference and should be referred to in the 
event of an error in the sequence described therein. A license may be required to make, 

0 use, or sell the deposited materials, and no such license is granted hereby. 

The following examples are included to more fully illustrate the present 
invention. Additionally, these examples provide preferred embodiments of the 
invention and are not meant to limit the scope thereof. Standard methods for many of 
5 the procedures described in the following examples, or suitable alternative procedures, 
are provided in widely reorganized manuals of molecular biology, such as, for example, 
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"Molecular Cloning," Second Edition (Sambrook et al., Cold Spring Harbor Laboratory 
Press, 1987) and "Current Protocols in Molecular Biology" (Ausubel et al.. eds. Greene 
Associates/Wiley Interscience. NY, 1990). 
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EXAMPLES 

EXAMPLE 1 
Isolation and Characterization of Sin 1 

5 

Below, the identification and molecular characterization of a positive 
strand RNA virus which exhibits reduced inhibition of host macromolecular synthesis 
and is capable of establishing persistent infection in vertebrate cells, as compared to 
lytic, cytopathogenic wild type strains of the same virus, is described. For example, 
10 Sindbis virus is used as a prototype representative of the Alphavirus genus. 

A. Isolation, plaque purification, and characterization of SIN-1 from a wild-tvpe 
Sindbis v irus stock 

The isolation, molecular cloning, and characterization of a Sindbis virus 
15 variant strain is described. This strain is able to establish productive persistent infection 
in the absence of cytopathicity, but produce levels of virus equivalent to that of 
wild-type virus. 

A high-titered (>10 PFU/ml) wild-type stock obtained by infection of 
BHK ceils (ATCC No. CCL-10) with Sindbis virus (CMCC #4639) at low MOI (< 0.1). 

20 To facilitate infection, the virus inoculum was contained in a volume just sufficient to 
cover the monolayer when added to the cells. BHK cells were maintained, and all virus 
dilutions were performed, in Eagle minimal essential medium supplemented with 10% 
fetal calf serum. Cells were cultured at 37°C in a 5% CO, atmosphere. Extensive CPE, 
as demonstrated by "rounding up", loss of adhesion, and increased light refraction of 

25 individual cells within the monolayer, and additionally, the decreased overall cell 
density of the monolayer, was observed within 48 hours post infection (hpi). The cell 
culture fluids were collected, cell debris was removed by low speed centrifugation 
(4,000 rpm for 10 min at room temperature), and the virus stock was aliquoted and 
stored at -70°C. The titer of the Sindbis virus stock was determined by plaque assay as 

30 described previously (Strauss et al., Virology 74:154-168, 1976). Briefly, chicken 
embryo fibroblasts (CEF) monolayers were infected with various dilutions of the virus 
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stock and the monolayer overiayed with media supplemented with 0.75% agarose. At 
24-48 hpi plaques due to cell lysis were visualized and quantitated either directly or. 
alternatively, by staining with crystal violet after removing the agarose overlay. The 
virus titer was determined from samples infected with virus dilutions in which the 
5 plaques were accurately quantitated. 

A Sindbis virus stock enriched for DI panicles was obtained by repeated 
high MOI passage {> 5) on BHK cells. BHK monolayers were infected initially with 
the Sindbis virus seed stock at an MOI = 5. The culture medium was collected and 
clarified by iow speed centrifugation after complete cell lysis of the infected culture was 

10 observed (usually within 24 hpi). The clarified medium collected from the infected 
culture was then used to infect a fresh BHK monolayer. For example, 2 ml of the virus 
inoculum was added to a fresh BHK monolayer in a 10 cm petri dish. At 1 hpi, S ml of 
fresh medium was added to the virus-infected culture. As described above, the culture 
medium was collected and clarified after observation of complete cell lysis of the 

] 5 culture. This process was repeated until the rate at which cytopathogenicity in the BHK 
monolayer developed after infection was delayed until at least 4 days. The delay in the 
onset of cyropathogenicity after infection signifies the presence of a high level of DI 
panicles in the virus preparation. 

The presence of a high level of DI particles in the virus preparation 

20 derived from multiple serial undiluted passages of infected cell medium was determined 
by an interference assay or by RNA analysis of BHK infected cells. In the first method, 
a homologous interference assay was performed as a measure of the presence of DI 
particles. Briefly, BHK cells were infected alone at an MOI = 10 with the high-titered 
(>10 PFU/ml) wild-type stock prepared as described above At 16 hpi, the virus yield 

25 was determined by plaque assay, as described above. The virus yield from this 

9 10 . 

experiment was typically 1x10 to 1 x 10 PFU/ml. In another experimental group, 
BHK cells were simultaneously coinfected with the wild-type stock (MOI = 10) and the 
virus stock prepared from multiple serial undiluted passages of infected cell medium 
(MOI = 5). As before, the virus yield was determined at 16 hpi by plaque assay. If the 
30 virus stock from the second experimental group contains a high level of DI particles, the 
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virus yield will be at least 2-3 orders of magnitude lower than the First experiment (e.g.. 

< 1 x 10 ? PFU/ml). 

As a more definitive method for demonstrating the presence of DI 
particles in the virus preparation, virus-specific RNA in BHK infected cells at 16 hpi 
5 was analyzed. Briefly, BHK cells were infected (MOI = 10) with the high-titered (>10 
PFU/ml) wild-type stock or with the virus stock containing DI particles. Mock-infected 
controls and infected cells were treated with dactinomycin (1 mg/ml) and labeled with 

[ J H]uridine (20 mCi/ml) from 1 to 16 hpi. RNA was isolated from infected and control 
cells by using RNAzol B, as described by the manufacturer (Tel-Test, Inc., 

10 Fnendswood. Texas). Alternatively, RNA was isolated with Tri-Reagent (Molecular 
Research Center, Inc.. Cincinnati, Ohio), or by conventional methods using phenol 
extraction of cells lysed in a buffer (0.05 M Tris, 0.1 M NaCl, 0.001 M EDTA, pH 7.5) 
containing 0.5% Triton and 0.5% recrystallized naphthalene disulfonate, as described 
by Weiss et al. (J. Virol 14\\ 189-1 198, 1974). The RNAs were denatured with giyoxal 

15 and electrophoresed through 1.1% horizontal agarose gels prepared in 0.01 M sodium 
phosphate buffer (pH 7.0), at 5V/cm (McMaster and Carmichael, Proc. Nail. Acad. Sci. 
USA 74:4835-4838, 1977). Alternatively, RNA can be electrophoresed through 
formaldehyde gels. Following electrophoresis, all moisture was removed from the gels 
under vacuum with a gel dryer, and the dried gels were treated for fluorography and 

20 exposed to film. Two RNAs, corresponding to the genomic and subgenomic species 
(42S and 26S, respectively), were observed in samples from BHK cells infected with 
the wild-type virus stock. In contrast, a large number of RNA species that are distinct 
from the standard viral 42S and 26S RNAs were observed in samples from BHK cells 
infected with the virus stock containing DI particles. Multiple RNAs corresponding to 

25 DI RNAs migrated predominantly at molecular weights smaller than the 26S RNA 
species from wild-type virus. An example of multiple RNAs in addition to the 42S and 
26S species observed in BHK cells infected with a virus stock containing DI particles 
may be seen in Figure 3, lane 3, of Weiss et al. (J. Virol. JJ:463-474 ; 1980). 

A Sindbis virus variant strain which is able to establish productive 

30 persistent infection with decreased cytopathogenicity was isolated and molecularly 
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cloned from a virus stock enriched for DI panicles. BHK cells were infected at high 
multiplicity (MOI = 5) with the Sindbis vims stock enriched for DI panicles. 
Cytopathogenicity developed slowly compared to infection of BHK cells with wild-type 
virus; however, most cells were eventually lysed and detached from the plate. Cell 
5 debris and non-adherent cells were removed every two days by medium changes. 
Within two weeks after initial infection, separate and distinct colonies were observed. 
These colonies were thriving and demonstrated no morphological evidence of CPE, 
compared to uninfected BHK cell controls. Within 3-4 weeks, the cell colonies were 
large and discernible to the naked eye. The colonies were isolated with cloning nngs, 

10 and the cells were dispersed with either 3 mM EDTA or trypsin. Dispersed cells from' 
each colony were replated without dilution. Thereafter, cells were subcultured at a 1:10 
dilution upon reaching confluency, generally within four days. Aliquots of cells, 
designated BHK(SIN-I), were prepared in cryotubes after the fifth passage for long 
term storage in liquid nitrogen. BHK(SrN-l) cells were indistinguishable from the 

1 5 original, uninfected BHK cells in terms of growth rate or morphology. 

B. Molecular Cloning of STN-1 

To characterize the mutation(s) in the Sindbis genome which correlate 
with the development of the substantially reduced cytopathogenicity of SFN-1, genomic 
20 RNA from SIN-1 virions was isolated, reverse transcribed, and the resultant cDNA 
encompassing the nonstructural protein genes sequenced, as described more fully 
below. 

Briefly, the SIN-1 virus was plaque purified three times before 
preparation of a stock that was used for the isolation of RNA. The BHK(SIN-l) cells 

25 were grown as described above, the culture fluid was collected, and various dilutions 
were used to infect primary chicken embryo fibroblast monolayers (CEF, grown in 
minimal essential medium supplemented with 3% fetal calf serum). Following 
infection, medium containing 0.5% noble agar was added to the monolayers. 
Additionally, DEAE-dextran (100 ug/mL) can be included in the agar-overlay medium 

30 to increase the size of the SIN-1 plaques. Individual discreet plaques were observed 
after 3 days of incubation at 30°C in plates infected with suitably dilute inoculums of 
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the BHK(SIN-l) culture fluid. After the third round of purification, the cloned SrN-1 
virus was passaged once at 30°C in CEF cells infected at a MOI = 0.1. The plaque- 
purified SIN-1 virus preparations were determined to be free of DI panicles by the 
interference assay and RNA analysis in BHK infected cells, as described above. 
5 BHK cells were infected (MOI = 10) with the plaque purified SIN-1 

stock to determine the ability of this wild-type Sindbis virus variant strain to establish 
persistence. As described above, establishment of persistent infection in BHK cells 
with wild-type Sindbis virus requires the presence of DI panicles in the virus 
preparation and considerable time to allow those few surviving cells to grow out. In 

10 contrast, persistent infection was readily established in BHK cells infected at 37°C with 
the SIN- 1 vanant whether or not DI panicles are present in the virus inoculum. At six 
days post infection with SIN-1, the BHK cells were completely resistant to 
superinfection with wild-type Sindbis virus, demonstrating establishment of a persistent 
infection. However, these cells were susceptible to infection by the heterologous virus, 

15 VSV, demonstrating that interferon is not involved in the establishment of SIN-1 
persistent infection. 

CEF cells were infected (MOI = 10) with the plaque purified SIN-1 stock 
to generate a high titered stock of virus for the isolation of RNA. Ninety ml of culture 
fluid (2xlO lo PFU/ml) was clarified by centrifugation for 5 min at 2,500 rpm in a Sorvall 

20 GSA rotor. The SIN-1 virus was pelleted from the clarified culture fluid by 
centrifugation in an SW27.1 rotor for 2h at 24,000 rpm, at 4°C. The vims pellet was 
then resuspended in 3 ml of culture media, homogenized by repeated pipetting, and 
layered on top of a 25-40% (w/w) sucrose gradient. This was followed by 
centrifugation in an SW27.1 rotor for 4 h at 24,000 rpm, at 4°C. The virus band was 

25. visualized with incandescent light illumination, and collected with a 22 gauge 
needle/syringe. The RNA was purified further by Proteinase K (Boehringer Mannheim, 
Indianapolis, Indiana) digestion (56°C, 1 hr), followed by extraction with an equal 
volume of H,0-saturated phenolxhlorform (1:1 v/v, pH 7.0), followed by precipitation 
with 2 volumes of ethanol and 0.3 M sodium acetate, pH 5.2. 
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Alternatively, the SIN- 3 virus can be further purified by polyethylene 
glycol precipitation of infected BHK cell culture media (Strauss et al. Virology 
753:154-168, 1976), or alternatively by pelleting through a sucrose cushion (Polo et al. 
J, Virol. 52:2124-2133, 1988). 
5 Two rounds of first strand cDNA synthesis were performed with the 

purified viral RNA. using the Moloney murine leukemia virus or Superscript II reverse 
transcriptases (Gibco-BRL, Gaithersburg, Maryland) according to the manufacturer's 
recommended conditions. Six separate reactions were performed, using the Sindbis 
virus specific primers complementary to the positive strand (primers denoted by R) 
10 shown below. All primers denoted by R were phosphorytated at their 5' end with 
polynucleotide kinase (New England Biolabs) prior to the first strand synthesis reaction 
to facilitate cloning. 
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Primer 


Location 


Seq. ID No. 


Sequence (5' -> 3') 


Enzyme Site 


T7/IF 


Sac I/T7: 1-20 


2 


GGTGGAGCTCTAATACGACT 


Sac I 








CACTATAGATTGACGGCGTA 










GTACACAC 




1465R 


1465-1451 


3 


AATTTCTG CCTC AGC 


Eco 47 [II 


1003F 


1003-1019 


4 


TATGCAAAGTTACTGAC 


Eco 47 III 


2S23R 


2S23-2S06 


5 


CTGTCATTACTTCATGTC 


BspHl 


1003F 


1003-1019 


4 


TATGCAAAGTTACTGAC 


Eco 47 III 


4303R 


4303-42S9 


6 


GCGTGGATCACTTTC 


Avrll 


405 IF 


4051-4069 


7 


ATTGCGTGATTTCGTCCGT 


AvrW 


8115R 


81 15-8101 


S 


TAAATTTGAGCTTTG 


Pmll 


1680F 


1680-16S9 


9 


GGC ATA TG G C A TT A GTTG 


Bsp HI 


8115R 


8115-8101 


8 


TA A ATTTG A GCTTTG 


Pmll 


8034F 


8034-8052 


10 


CTGGCCATGGAAGGAAAGG 


Pmll 


1 1.703R 


Xho I/dT,/ 


] 1 


CCCCTCG AGGGT(2 1 )GAAATG 


Xho I 




11703-11677 




TTAAAAACAAAATTTTGTTG 





Synthesis of the second strand from the cDNA template above was 
accomplished in six separate reactions with the Klenow fragment of DNA polymerase I 
(New England Biolabs, Beverly, MA) according to the manufacturer's recommended 
5 conditions. Sindbis virus specific primers complementary to the negative strand 
(primers denoted by F shown above) were used. The double-stranded DNA products 
were substituted, stepwise, for the corresponding regions in the plasmid Toto 1101, 
which contains the full-length Sindbis virus genome (Rice et al., J. Virol. 67:3809-3819, 
1987). For example, the T7/1F-1465R product was digested with Sac I and Eco 47III, 
10 and inserted into Sac l/Eco 47III digested and CIAP treated TotollOI plasmid, which 
was purified away from the Sac UEco 47III small fragment (SP6 promoter, Sindbis 
virus nts. 1-1407) by 1% agarose/TAE (50X/liter: 242 g Tris base/57.1 ml glacial acetic 
acid/100 ml 0.5 M EDTA pH 8.0) electrophoresis, and GENECLEAN II. This 
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construct was then digested with Eco 47I1I and Avr II (Sindbis virus nt nos. 1407 and 
4281. respectively), treated with CIAP. and followed by insertion of the 3300 bp 
fragment isolated from the 1003F-4303R product, digested with Eco 47 III and Avr II. 
The fully assembled clone is designated as pRSIN-lg (g, as a reference to full-length 
5 genomic clone), and contained all 1 1,703 bp of viral genome. A subset of the primers 
listed in the table above generate redundant double-stranded DNA reaction products 
within the SIN-1 genome. For example, the sequences in the 4051F/81 15R product are 
within the 1 6S0F/S 1 15R product. These redundant products are provided as 
construction alternatives for the SIN-1 genomic clone; i.e., in general, the efficiency of 

10 cDNA cloning is inversely proportional to the length of the desired fragment. 

To done portions of the viral genome not obtained by the above method, 
the SIN-1 RNA viral genome was cloned by reverse transcription polymerase chain 
reaction (RT-PCR). First strand synthesis was accomplished as described above. PCR 
amplifications of Sindbis cDNA with the primer pairs shown above were performed as 

15 separate reactions, using the Klemaql enzyme, and the reaction conditions, as described 
in Barnes {Proc. Natl. Acad. Sci. USA 97:2216-2220, 1994). Alternatively, the 
Thermalase thermostable DNA polymerase (Amresco Inc.. Solon, OH) was substituted 
for the Klentaq 1 enzyme, using a buffer containing 1 .5 mM MgCl. that was provided 
by the supplier. Alternatively, the VentR thermostable DNA polymerase (New England 

20 Biolabs, Beverly, MA) was used in the amplification reactions. Additionally, the 
reactions contained 5% DMSO and "HOT START WAX" beads fPerkin-Elmer). The 
PCR amplification protocol used is shown below (The 72°C extension incubation 
period was adjusted to 1 min per 1 kb of template DNA): 
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Temperature (°C) 


Time (Min.) 


No. Cycles 


(a) 


95 


2 


1 


(b) 


95 


0.5 






55 


0.5 


35 




72 


3.5 




(c) 


72 


10 


1 



Alternatively, cloning of the full-length SIN-1 RNA genome can be 
performed similar to methods which have previously been described (Dubensky et al, 
J. Virol. 70:508-519, 1996). Briefly, first strand cDNA synthesis is accomplished with 
5 a mixture of random hexamer pnmers (50 ng/ml reaction concentration) (Invitrogen, 
San Diego, California), and primer 4B, whose sequence is shown in the table below. 
Genomic length Sindbis virus SIN- 1 variant cDNA is then amplified by PCR. Six 
distinct segments using six pairs of overlapping primers is sufficient to clone the entire 
genome. In addition to viral complementary sequences, the SIN-1 5' end forward 

10 primer contains a 19 nucleotide sequence corresponding to the bacterial SP6 RNA 
polymerase promoter and the Apa I restriction endonuclease recognition sequence 
linked to its 5' end. The bacterial SP6 RNA polymerase is poised such that transcription 
in vitro results in the inclusion of only a single non-viral G ribonucleotide linked to the 
A ribonucleotide, which corresponds to the authentic Sindbis virus 5' end. Inclusion of 

15 the Apa I recognition sequence facilitates insertion of the PCR amplicon into the 

piasmid vector pKS IP (Stratagene, La Jolla, California) polylinker sequence. A five 
nucleotide 'buffer sequence' extension is also linked upstream from the Apa I 
recognition sequence in order to facilitate efficient enzyme digestion. The sequence of 
the SP6-5' SIN-1 forward primer and all of the primer pairs necessary to amplify the 
20 entire SIN-1 genome are shown in the table below. (Note that "nt" and "nts" as utilized 
hereinafter refer to "nucleotide" and "nucleotides," respectively). The reference 
sequence (GenBank accession no. J02363, locus: SINCG) is from Strauss et al, 
Virology /JJ:92-110, 1984. 
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Pnmer Location Seq. ID No. Sequence (5'->3') Recognition 

Sequence 



SP6-1 A 


Apa I/SP6/ 


12 


TATATGGGCCCGATTTAGGTGAC 


Apa I 




SIN nts.l-lS 




ACTATAGATTGACGGCGTAGTAC 










AC 




IB 


318^-3 160 


13 


CTGGCAACCGGTAAGTACGATAC 


Age I 


2A 


3144-3164 


14 


ATACTAGCCACGGCCGGTATC 


Age I 


2B 


5905-5585 


15 


1 LL 1 L 1 1 1 CuALU 1 0 1 LurAUL 


t CO KJ 


3 A 


5844-5864 


16 


ACCTTGGAGCGCAATGTCCTG 


Eco RJ 


7349R 


7349-7328 


17 


CCTTTTCAGGGGATCCGCCAC 


5am HI 


7328F 


7328-7349 


18 


GTGGCGGATCCCCTGAAAAGG 


Bam HI 


3B 


93S5-9366 


19 


TGGGCCGTGTGGTCGTCATG 


5c/ 1 


4A 


9336-9356 


20 


TGGGTCTTCAACTCACCGGAC 


5c/ 1 


I0394R 


10394-10372 


21 


CAATTCGACGTACGCCTCACTC 


55/ WI 


I0373F 


10373-10394 


22 


GAGTGAGGCGTACGTCGAATTG 


Bsi WI 


4B 


Xbal/T .' 


23 


TATATTCTAGA(T 35 )GAAATG 


A7jg I 




1 1703-1 i 698 









PCR amplifications of Sindbis cDNA with the primer pairs shown above 
are performed as separate reactions, using the Thermalase or Vent R DNA polymerases 
(cited above), reaction conditions, and the PCR amplification conditions, as described 
5 above. 

The regions of sequence overlap between the amplification products 
correspond to unique enzyme recognition sites within the PCR amplicon. The PCR 
products are purified (QIAquick PCR purification kit, Qiagen, Chatsworth, California) 

and inserted stepwise into the pKS II + vector, between the Apa I and Xba I sites. The 
10 fully assembled clone is designated as pKSRSIN-lg (g, as a reference to full-length 
genomic clone), and contains all 1 1,703 bp of viral genome. 

C. Sequence of the SIN-1 phenotvpe 

The SIN-1 specific nucleotide sequences of the pRSIN-lg clone was 
15 determined by the dideoxy-chain termination method. Sequence comparison of 8,000 
bp of viral sequence revealed multiple differences between the SIN-1 clone described 
herein and the Sindbis virus (strain HRsp) sequence provided in GenBank (GenBank 
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Accession no. J02363, locus: SINCG). Differences in the sequence among SIN-1 
(Figure 6), SINCG (Figure 7), and Totol 101 (Figure 8) are presented below. 



nt. Position 


Gene 




SINCG 


Toto 1101 




SIN-1 






nt 


aa 


nt 


aa 


nt 


aa 


45 


5'NTR 


T 


-- 


T 


— 


C 


-- 


120 


nsPl 


C 


Gin 


C 


Gin 


A 


Lys 


1775 


nsP2 


G 


null 


G 


null 


A 


null 


1971 


nsP2 


T 


Phe 


T 


Phe 


C 


Leu 


2992 


nsP2 


C 


Pro 


T 


Leu 


T 


Leu 


3579 


nsP2 


A 


Lys 


G 


Glu 


G 


Glu 


3855 


nsP2 


C 


Pro 


C 


Pro 


T 


Ser 


3866 


nsP2 


C 


null 


C 


null 


T 


null 


4339 


nsP3 


A 


Glu 


A 


Glu 


T 


Vai 


4864 


nsP3 


C 


Ser 


C 


Ser 


T 


Phe 


5702 


nsP3 


A 


null 


T 


null 


T 


null 


5854* 


nsP4 


G 


Arg 


G 


Arg 


A 


His 


7612 


junction 


A 




A 




T 




7837 


Capsid 


C 


Arg 


C 


Arg 


T 


Cys 



5 * This mutation was found in one cDNA clone. It was not detected when the Sin- 1 virus 
RNA was sequenced. It likely represents a minor species in the RNA population. 

Verification that the sequence changes were unique to the clone (and not 
the result of cloning artifact) described herein, was determined by amplifying SIN-1 
10 virion RNA by RT-PCR as described above, establishing the sequence containing the 
nucleotides in question by direct sequencing of the RT-PCR amplicon product, and 
comparing the sequence to the corresponding SIN-1 sequence. 

D. Characterization and genetic mapping of the S IN-1 phenotvpe with molecular 
15 clones: 

Various regions of the SIN-1 genome were substituted for the 
corresponding wild-type Sindbis virus region in the Totol 101 plasmid (Rice et al., J. 
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Virol. 67:3809-3819, 1987) in order to map the location of the phenotype for 
establishment of persistence. The various SIN-1 nsP genes were substituted into the 
Toto 1101 wild-tvpe Sindbis virus background using restriction enzyme fragments 
purified from pRSIN-lg, as illustrated in the table below. 



nsP Gene 


Restriction Fragment 


Nucleotide 
Coordinates 


Clone Designation 


nsPl 


Pie MEco 47III 


98-1407 


pRSIN-lnsPl 


nsP2 


Eco41\WAvrll 


1407-4281 


pRSIN-lnsP2 


nsP2-N terminus 


Eco 41UVBgl II 


1407-22S9 


pRSIN-lnsP2-N 


nsP2-C terminus 


BglWAvr II 


2289-4281 


pRSIN-lnsP2-C 


nsP3-4 


Avr WEcoRl 


4281-5870 


P RSIN-lnsP3 


nsP3 


AvrWSpe I 


4281-5262 


pRSIN-lnsP3 


nsP4 


Spe VAat 11 


5263-5870 


pRSIN-lnsP4 


nsPl-4 


Pie VAat II 


98-8000 


pRSIN-lnsPl-4 



The coordinates of the nonstructural gene coding regions are provided in the following 
table: 



nsP Gene Coordinates of Sindbis 

virus genome (nt. no.) 

^sTl 60-1680 

nsP2 1680-4101 

nsP3 4101-5769 

nsP4 5769-7597 

nsPl-4 60-7597 



The various SIN-1, Toto, and chimeric SIN-1 /Toto clones, pRSIN-lg, 
Toto, pRSIN-lnsPl, pRSTN-lnsP2 ? pRSIN-lnsP2-N, pRSIN-lnsP2-C f pRSIN-lnsP3, 
pRSIN-lnsP4, and pRSIN-lnsPl-4 were linearized by digestion with Xho I, which 
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makes a single cut in the cDNA clones immediately adjacent and downstream of a 21 
nucleotide poly dA:dT tract following the Sindbis virus 3' end (viral nt. 11703). The 
linearized clones were purified with GENECLEAN II (BIO 101, La Jolla. California), 
and adjusted to a concentration of 0.5 ug/ul. Transcription of the linearized clones was 
5 performed in vuro at 40°C for 90 minutes according to the following reaction 
conditions: 2 ul DNA/4.25 ul H 2 0); 10 ul 2.5 raM NTPs (UTP, ATP, GTP, CTP); 
1.25 ul 20 mM Me7G(5')ppp(5')G cap analogue; 1.25 ul 100 mM DTT; 5 jal 5X 
transcription buffer (Promega, Madison Wisconsin); 0.5 jil RNasin (Promega); 0.25 ul 
10f!g/fil bovine serum albumin; and 0.5 ul T7 RNA polymerase (Promega). The 

10 in vitro transcription reaction products were digested with DNase I (Promega), purified 
by sequential phenol: CHC1 3 and ether extraction, and followed by ethanol precipitation. 
Alternatively, the in vitro transcription reaction products can be used directly for 
transfection. The in vitro transcription reaction products or purified RNA were 
complexed with a commercial cationic lipid compound (LIPOFECTIN, GIBCO-BRL, 

15 Gaithersburg, MD) and applied to Baby Hamster Kidney-21 (BHK-21) cells maintained 
in a 60 mm petri dish at 75% confluency. Alternatively, BHK cells were electroporated 
with the in vitro transcription reaction products or purified RNA, exactly as described 
previously (Liljestrom, Bio/Technology 9: 1356- 1361, 1991). The transfected ceils were 
incubated at 37°C. At 48 hours post-transfection. culture media were collected and the 

20 titer of each virus was determined by plaque assay, as described above. The titered 
virus stocks derived from these in vitro transcription reactions were designated as 
shown in the table below. 
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Clone Designation 


Virus Designation 


pRSIN-lnsPl 


SIN-lnsPl 


pRSIN-lnsP2 


SIN-lnsP2 


pRSIN-lnsP2-N 


SIN-lnsP2-N 


P RSIN-lnsP2-C 


SIN-lnsP2-C 


pRSIN-!NSp3-4 


SIN-lnsP3-4 


pRSIN-insP3 


SIN-lnsP3 


pRSrN-lnsP4 


SIN-lnsP4 


pRSIN-lnsPl-4 


SIN-lnsPl -4 


Toto 1 101 


Toto 



To map the SIN-1 persistent phenotype, 8 x 10 s BHK cells were infected 
(MOI=5) with each of the virus stocks prepared above. At 3 days post infection, the 
culture viability was determined by trypan blue dye exclusion. The results of this 
5 experiment (shown below), demonstrate that the SIN-1 phenotype of establishing 
persistent non-cytocidal infections maps to the nonstructural genes, and to nsP2 gene in 
particular. The number of cells in the mock-infected culture represents continued 
growth of these cells until they reached the stationary phase. At 3 dpi, cells infected 
with SIN-lnsPl, SIN-lnsP3. SIN-l-nsP3-4 and SIN-insP4 had all died. The cells that 
10 survived infection with SIN-lnsP2 and SIN-lnsPl -4 continued to grow and were 
persistently infected based on staining with antibodies specific for Sindbis virus. 
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Virus 


Number of Cells at 3 dpi 


SIN-lnsPl 


A 

u 


SIN-lnsri 


3 x 10 


SIN-lnsP3-4 


0 


SIN-lnsP3 


0 


SIN-lnsP4 


0 


SIN-1 nsP 1-4 


5 x 10 ; 


Toto 


0 


Mock 


1 x 10" 



As shown above, the observed SIN-1 phenotype of establishing non- 
cytocidal persistent infections maps to the viral nsPs. as opposed to the sPs. This 
conclusion was demonstrated clearly by comparison of cell survival levels between 
5 cultures infected with the Toto or SIN-lnsPl-4 virus stocks. Both the Toto and SIN-1 
nsPl-4 viruses contain the wild-type sPs; cell survival was observed, however, only in 
those cultures infected with the virus (SIN-lnsPl-4 ) containing nsPs derived from the 
SIN-1 clone. In these experiments, cell survival was not dependent upon the source of 
the Sindbis virus sPs. Importantly, the SIN-1 phenotype was mapped further to nsP2. 

10 The level of cell survival was comparable between cultures infected with the SIN- 
InsPl-4 or SIN-lnsP2 viruses. Further, a C -> T transition at nucleotide 3855, in the 
SIN-1 nsP2 gene is responsible for the characteristic phenotype of establishment of 
persistent infection in cells infected with the SIN-1 virus. The single proline to serine 
change in the nsP2 protein produced in cells infected with the chimeric virus SIN- 

15 lnsP2-C, was all that was required to convert wild-type Sindbis virus (Toto 1101) from 
a virus that killed all of the infected cells into a virus which permitted many of the 
infected cells to survive and continue to produce virus. The phenotypes of chimeric 
viruses derived from insertion of the SIN-1 nsPl, nsP3, or nsP4 genes into the Toto 
background were indistinguishable from wild-type and complete lysis was observed in 

20 cultures infected with these viruses. 
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The possible effect of amino acid changes in the SIN-1 nsPs on the level 
of productive infection in BHK cells was determined by comparing the virus yield over 
a time course in BHK cells inoculated with the various SIN-1, wild-type (Toto), or SIN- 
1/Toto chimeric strains. Briefly, BHK cells were infected (MOI=20) with SIN- 1 
5 (plaque purified stock described above), Toto, SIN-lnsPl-4, or SIN-lnsP2 viruses, and 
the culture fluids were collected at 3, 6, 9. and 12 hours post infection. The titers of 
virus in the culture fluids were then determined by plaque assay, as described above. 
The results of this study, shown in Figure 2. demonstrate that equivalent levels of virus 
were produced in BHK cells infected with wild-type or SIN-1 strains. The actual virus 
10 titers at the 12 hpi time point are set forth in the table below. More than half of the 
BHK cells survived infection with SIN-1 virus (and chimeric viruses containing SfN-1 
nsPsl-4, or SIN-1 nsP2) in combination with levels of virus production equivalent to 
wild-type strains. 



Virus Titer (PFU/ml) at 12 hpi (x 10 ) 
SIN-1 J2 
Toto 2.7 
SIN-lnsPl-4 1.8 
SIN-lnsP2 2.6 



15 



The possible effect of amino acid changes in the SIN-1 nsPs on the level 
of viral-specific RNA synthesis was determined by comparing the level of [ 3 H]-uridine 
incorporation over a time course in BHK cells inoculated with the various SIN-1, wild- 
type (Toto), or SIN-l/Toto chimeric strains. BHK cells (3 x 10 5 celis/35 mm dish) were 

20 grown at 37°C, according to the conditions described herein. The cells were infected 
(MOI = 20) with SIN-1, TotollOl, SIN-lnsPl-4, or SIN-lnsP2 viruses. At 30 min. 
post infection, the culture medium was adjusted to 1 jag/ml actinomycin D. After 
incubation for an additional 30 min. the culture medium was adjusted to 10 u,Ci/ml 
[ 3 H]-uridine. At 3, 6, 9, and 12 hpi, the treated cells were washed with PBS, and lysed 

25 by addition of 200 ul of TTE buffer (0.2% Triton X-100, 10 mM Tris-HCl, pH 8.0, 1 
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mM EDTA). The RNA was precipitated at 4°C by addition of 200 |il of a 25% 
trichloroacetic acid (TCA) solution. The RNA was pelleted by microcentrifugation for 
5 min at 14.000 rpm, rinsed once with 5% TCA, and dissolved in a solution consisting 
of 50 mM NaOH/0.1 % SDS. at 55°C. The solution containing the dissolved RNA was 
5 transferred into scintillation vials and the level of incorporated [ 3 H]-uridine was 
determined using EcoLume (ICN, Irvine, CA) scintillation fluid. The results of this 
study, shown in Figure 3, demonstrate that levels of virus-specific RNA were 
dramatically lower in BHK cells infected with SIN-1 compared to wild-type virus. This 
phenotype of low level of virus-specific RNA synthesis maps to the nsPs, as shown by 
10 the equivalently low levels of RNA produced in BHK cells infected with the SIN-1 or 
SIN-lnsPl-4 strains, compared to wild-type. 



Virus [ 3 H]uridine incorporation (x 1 0 J cpm) 

SIN-1 22 
TotollOl 86 
SIN-lnsPl-4 28 
SIN-lnsP2 62 
Mock 8 



Virus-specific RNA synthesis in infected BHK cells was also determined 
15 using all of the SIN-1, Toto, and SIN-l/Toto chimeric, strains. The levels of 
[ 3 H]uridine incorporation, relative to wild-type infection (Toto) at 9 hpi, are shown in 
Figure 4, and given in the table below. 
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Virus No. of Experiments Relative RNA Standard Deviation 

Synthesis Level 



TotollOl 


9 


1.0 




SIN-1 


9 


0.1 


±0.1 


SIN-lnsPl-4 


3 


0,2 


±0.1 


Jllx 1 Hoi i 


7 


1.0 


±0.2 


SIN-lnsP2 


S 


0.6 


±0.1 


SIN-lnsP3 


4 


1.0 


±0.0 


SIN-lnsP3-4 


7 


0.4 


±0.1 


SIN-lnsP4 


6 


0.8 


±0.1 


SIN-lnsP2-N 


1 


0.9 




SrN-lnsP2-C 


3 


0.6 


±0.2 


Mock 


9 


0.0 





Thus, BHK cells survive infection with SIN-1 virus (and chimeric viruses containing 
SIN-1 nsPsl-4, or SIN-1 nsP2), SIN-1 virus levels equivalent to wild-type strains are 
produced in BHK ceils, and, the level of viral-specific RNA synthesized is 10-fold 
5 lower compared to wild-type virus. 

The possible effect of amino acid changes in the SIN-1 nsPs on the level 
of inhibition of host cell protein synthesis was determined by comparing the level of 
protein synthesis, relative to uninfected cells, over a time course in BHK cells 
inoculated with the various SIN-1, wild-type (Toto), or SIN-l/Toto chimeric strains. 

10 Briefly, 35 mm dishes seeded with 2 x 10 ? BHK cells were infected (MOI - 20) with 
the various Sindbis virus strains in a 0.5 ml virus inoculum in a buffer consisting of 
PBS/1% fetal calf serum. The plates were incubated at 4°C for 1 hr with continuous 
gentle shaking. The inoculum was replaced with 2 ml of medium, described previously, 
containing 10% fetal calf serum, and the dishes were placed in a CO, incubator at 37 C C. 

15 At 5, 8, and 1 1 hi later, the media was replaced with 2 ml of MEM lacking methionine 
(Met-), with 2% fetal calf serum, and incubated 30 min. The medium was then replaced 
with 1 ml of MEM (Met-) containing 2% fetal calf serum and 10 uCi/ml of 
[ J5 S]methionine. Following a 30 min incubation period at 37°C, 1 ml of medium 
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containing 10% fetal calf serum was added to each well, and the dishes were incubated 
for another 30 min at 37°C. This dilution is sufficient to inhibit further incorporation of 
radioactive label into protein and significantly decreases the background of free [ 3S S]- 
methionine detected in polyacryiamide gels. The medium was then removed from the 

5 well Cells were washed three times with PBS, scraped from the dish into PBS, pelleted 
by centrifugation. and dissolved in 25 ul of loading buffer (0.06 M Tris-HCl, pH 6.7, 
2% SDS, 5% p-mercaptoethanol. 5% glycerol, 0.05% bromophenol blue). One-fifth of 
the sample was analyzed on the gel. After electrophoresis, the gels were stained with 
Coomassie brilliant blue R. dried, and autoradiographed. The rates of inhibition of host 

0 cell protein synthesis were compared by quantitating the amount of radioactivity in the 
section of the gel containing only host proteins. The results of this study are shown in 
Figure 5, and demonstrate that the level of inhibition of host cell protein synthesis is 
significantly lower in SIN-l virus infected cells, compared to wild-type virus infected 
cells, particularly at the earlier 6 and 9 hour time points post infection. 

5 In summary, BHK cells survive SIN-l infection, virus levels equivalent 

to wild-type strains are produced in BHK cells, the level of viral-specific RNA 
synthesized is 10-fold lower than wild-type virus, and the level of inhibition of host cell 
protein synthesis in SIN-l virus infected cells is significantly lower compared to wild- 
type virus infected cells. The phenotypes of the SIN-l virus described herein map to 

0 the viral nsP genes. 

EXAMPLE 2 

Isolation and Characterization of Positive Strand RNA Viruses Which 
5 Exhibit Reduced Inhibition of Host Macromolecular Synthesis 

The derivation of virus variants exhibiting the desired phenotypes of 
reduced, delayed or no inhibition of host cell macromolecular synthesis is dependent on 
the generation, characterization, and isolation of sequences which differ from that of 
0 wild-type virus. However, in addition to example 1, there are no obvious or previously 
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disclosed methods to select for or identify coding or non-coding viral sequence changes 
that result in alteration of this virus-based inhibition of macromolecular synthesis, or the 
generation of viruses that lead to persistent, rather than lytic, infection. The present 
invention provides specific methods, using alphaviruses as an example, that enable one 
5 to overcome these obstacles. 

A. Biological Selection of Virus Variants 

The biological derivation of virus variants which result in reduced, 
delayed, or no inhibition of host macromolecular synthesis, or which establish persistent 

10 infections, can be performed by allowing for natural, spontaneous mutation within a 
cell, or first subjecting the desired virus stock to physical, chemical or other artificial 
mutagenesis, followed by infection of susceptible cells, and successive enrichments for 
those cell populations which harbor mutated virus. It is possible that prior mutagenesis, 
although not required, will facilitate the generation of appropriate mutations. The 

15 selection is based on the ability of cells infected with the desired variant to survive for 
significantly longer periods than wild-type virus infected cells. The following examples 
provide representative methods in detail, using Sindbis virus as an example; however, 
other viruses are readily substituted, as noted in the detailed description. 

Specifically, in the case of chemical mutagenesis, a Sindbis virus stock 

20 suspension with a titer of greater than or equal to 10 9 pfu/ml is treated with 
nitrosoguanidine at a final concentration of 100 ug/ml. After 15 minutes at room 
temperature, the nitrosoguanidine is removed by dialysis at 4°C and the mutagenized 
stock is subsequently used for infection. Approximately 5xl0 6 ceils of the desired type 
(for example BHK-21), grown in flat stock culture, are infected with the mutagenized 

25 virus at a multiplicity of infection (M.O.I.) of approximately 5, to ensure that every cell 
is infected.. At 12. 24, 36, and 48 hours post-infection, the cell monolayer is washed 
twice with fresh media to remove dead cells, and replaced with media consisting of a 
mixture of 50% fresh media and 50% conditioned media. After a desired time post- 
infection (for example 72 hours), the remaining cells are gently trypsinized to detach 

30 them from the culture dish and strip away any cell-associated extracellular virus, which 
is then separated from the cells by differential centrifugation. The remaining cells are 
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then re-seeded directly into a tissue culture dish containing a semi-confluent uninfected 
cell monolayer, at a ratio of 1 infected cell for every 10 3 uninfected cells. Additional 
rounds of selection are performed by seeding onto uninfected cells for amplification, 
followed by the above washing and harvesting steps. Alternatively, the initial 
5 infections may be done using a wild-type virus stock at low M.O.I., allowing for 
spontaneous mutation during replication within the cell. Using this approach, a 
heterogenous population of mutant virus is produced by the infected cells. In those 
instances where the population of infected cells recovered after trypsinization includes 
a significant number of non-viable or severely damaged cells, a brief treatment (5 

10 minutes at room temperature) with 0.75% NH 4 C1 in L7 mM Tris (pH 7.65 with HC1), or 
centrifugation through Percoll™, is included to remove remaining dead or damaged 
cells, prior to re-seeding onto uninfected cell monolayers. After a minimum of two 
successive rounds of selection, virus variants displaying the desired phenotype are 
isolated by limiting dilution or plaque purification, and subjected to cDNA cloning as 

15 described in Example 1. Isolation and characterization of specific sequences 
responsible for the variant phenotype are accomplished by substitution of defined 
regions of cDNA into a genomic clone or expression vector and testing for the 
accompanying phenotypic change, as outlined in the Examples. 

20 B. Genetic Selection of Virus Variants 

In a related approach, natural mutation or random mutagenesis is 
performed, not on a virus stock, but rather, using cloned genomic cDNA of the virus 
that can be transcribed into infectious viral RNA in vitro or in vivo. For example, in the 
case of prior mutagenesis, plasmid pRSINg, which contains a full-length genomic 

25 Sindbis cDNA functionally linked to a bacteriophage SP6 promoter (Dubensky, et ah, 
J. Virol. 70:508-519, 1996), is transformed into competent E. coli XLl-Red mutator 
cells and plated on ampicillin plates to obtain colonies. At least 200 colonies are 
chosen at random, pooled, and inoculated for overnight growth in a 10 mi broth culture 
containing ampicillin. Plasmid DNA is prepared from the culture to obtain a 

30 heterogeneous population of pRSINg harboring various mutations. The DNA is 
linearized with Xba I and transcribed in vitro using SP6 polymerase, as described 
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previously (Rice et aL /. Virol (57:3809-3819, 1987). RNA transcripts are 
subsequently transfected into the desired cell type (for example, BHK cells) by 
electroporation (Liljestrom and Garoff, Bio/Technology 9:1356-1361, 1991), for 
initiation of the Sindbis virus infection cycle. Alternatively, in vitro transcribed RNA 

5 from an unmutagemzed template also may be transfected. Selection of virus mutants 
which establish a persistent infection or exhibit reduced, delayed, or no inhibition of 
host macromolecular synthesis is performed as above. The selected virus variants with 
the desired phenotype are isolated by limiting dilution or plaque purification, subjected 
to cDNA cloning, and the sequences responsible for the variant phenotype are isolated 

1 0 and characterized, as described previously. 

C. Genetic Selection of Variants Using Virus-Derived Vectors 
1. Vectors Expressing an Immunogenic Protein 

In another approach, spontaneous intracellular mutation, or random 

15 mutagenesis is performed on virus-derived sequences of a viral-based expression vector. 
These sequences include non-coding and regulatory regions, as well as nonstructural 
protein encoding regions. In certain instances, structural protein-encoding sequences 
also may be included. Such random mutagenesis or spontaneous intracellular mutation 
may be performed using any of the techniques described in this invention, along with 

20 the cloned cDNA of a virus-derived vector which can be transcribed into RNA in vitro 
or in vivo. For example, a replication-competent Sindbis virus expression vector may 
be used to express an immunogenic cell surface protein or other peptide which may be 
bound by specific antibodies added to the infected cells. Cells which contain functional 
vector are identified by their expression of the vector-encoded heterologous antigen and 

25 ability to be bound by antibody specific for the encoded antigen. By limiting the 
selection process to cells surviving for extended periods (see above), only those 
harboring vector variants exhibiting the desired phenotype are enriched. 

Specifically, in the case of random mutagenesis, plasmid pTE3'2J (Hahn 
et al., J. Virol. <SP:2679-2683, 1992), comprising an SP6 promoter operably linked to a 

30 full-length genomic Sindbis cDNA with a duplicated subgenomic promoter for 
expression of heterologous genes, is mutagenized as described above. This process 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/21062 



116 

results in isolation of a population of heterogeneous plasmid containing the random 
mutations. In parallel, the desired heterologous cell surface protein or marker peptide 
gene is cloned into a shuttle vector for insertion into the mutagenized pTE3'2J vector. 
Preferred cell surface proteins for use as markers include, but are not limited to, human 
5 B7.1 (Freeman et ai., J. Immunol. 1 4 3:21 1 4-27 r 22, 1989) and the murine H-2K b class I 
molecule (Song et at., J. Biol. Chem. 269:1024-7029, 1994). The human B7.1 gene is 
amplified by standard three-cycle PCR, with 1.5 minute extension, from a pCDM8 
vector containing the full-length cDNA sequence (Freeman et al., ibid), using the 
following oligonucleotide primers that are designed to contain flanking Xba I and Bam 
10 III sites. 



Forward primer: hB7.1 FX C5'-rest. site/B7,i sequence) fSEO. ID. NO. 24) 
5'-ATATATCTAGA/GCCATGGGCCACACACGGAGGCAG-3' 

1 5 Reverse primer: hB7.1 RB (g'-rest. site/B7.1 sequence) (SEP. ID. NO. 25) 
5'-ATATAGGATCC/CTGTTATACAGGGCGTACACTTTC-3' 

Following amplification, the approximately 875 bp DNA fragment is 
purified using a QIAquick-spin PCR purification kit (Qiagen. Chatsworth, CA), 

20 digested with Xba I and Bam HI and ligated into Sindbis shuttle vector pH3'2Jl (Halm 
et al., ibid) that also has been digested with Xba I and Bam HI and treated with calf 
intestinal alkaline phosphatase, to create the construct pH3'B7.1 . 

Following random mutagenesis of the pTE3'2J double subgenomic 
vector, as described above, plasmid pH3'B7.1 and the mutated population of plasmid 

25 pTE3'2J are digested with Apa I and XJw I, purified from 0.7% agarose gels using 
GENECLEAN II II™ (BiolOl, Vista, CA), and ligated to form a heterogeneous 
population of a B7.1 expression vector, designated pTE3'B7.1. Without transforming 
E. coli and isolating individual clones, the entire population of ligated vector is 
linearized with A7io I and used as template for in vitro SP6 transcription reactions, as 

30 described above. The heterogeneous population of randomly mutagenized B7.1 vector 
transcripts is then electroporated into the desired cell type (for example, BFIK cells) for 
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initiation of the Sindbis virus replication cycle. Selection for virus mutants which 
establish a persistent infection or exhibit reduced, delayed, or no inhibition of host 
macromolecular synthesis is performed using a monoclonal antibody specific for B7.1 
(Pharmmgen, San Diego, CA) and either magnetic- or fluorescence-activated cell 
5 sorting protocols. The preferred secondary antibody tags include rat-anti-mouse IgG 
conjugated with magnetic microbeads for magnetic cell sorting (miniMACS Magnetic 
Separation System. Miltenyi Biotec, Auburn, CA; Miltenyi et al., Cytometry 77:231- 
238, 1990), and FITC-conjugated rat anti-mouse IgG (Pharmingen, San Diego, CA) for 
fluorescence activated cell sorting. Using such an approach and harvesting cells after 
10 an extended period {see above), only viable cells which contain a functional virus- 
derived vector {as evidenced by B7.1 expression), displaying the desired phenotype, are 
enriched. 

Specifically, the heterogeneous population of randomly mutagenized 
B7.1 vector transcripts is electroporated into lxlO 7 cells, according to the procedure of 

15 Liljestrom and Garoff (1991, ibid), and plated as a flat stock culture. At 12, 24, 36, and 
48 hour post-infection, the cell monolayer is washed twice with fresh media to remove 
dead cells, and replaced with media consisting of a mixture of 50% fresh media and 
50% conditioned media. After a desired time post-infection (for example 72 hours), the 
remaining cells are gently trypsinized to detach them from the culture dish, and pelleted 

20 by centrifugation at 1000 rpm, 4°C. The cells are resuspended in 2 ml of blocking 
solution (PBS + 10% fetal calf serum + 1% BSA), incubated on ice for 10 minutes, and 
re-pelleted. Next, the cells are resuspended in 200 ul of the primary anti-B7.1 antibody 
solution (diluted in PBS + 0.5% BSA), and incubated on ice for 30 minutes. The cells 
are washed twice with PBS + 0.5% BSA, pelleted, and resuspended in 200 ul of 

25 magnetic bead solution (200 ul washed magnetic rat anti-mouse coated beads in PBS + 
0.5% BSA + 5 mM EDTA). Following incubation at 4°C for 30 minutes, the bead- 
bound cells are washed twice with PBS + 0.5% BSA + 5 mM EDTA, and resuspended 
in 1 ml of the same buffer. The bead-bound cells are then purified using the MiniMacs 
magnet column, according to the manufacturer's directions. The eluted positive cells 

30 are then re-seeded directly into a tissue culture dish containing a semi-confluent 
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uninfected cell monolayer, at a ratio of 1 infected cell for every 10 4 uninfected cells. 
Additional rounds of selection are performed as above for amplification/enrichment. 
The selected vector variants with the desired phenotype are isolated by limiting dilution 
or plaque purification, subjected to cDNA cloning, and the sequences responsible for 
5 the variant phenotype are isolated and characterized, as described previously. 

Alternatively, the B7.1 expression vector, pTE3'B7.1 f may be used 
directly for in vitro transcription without prior mutagenesis. Following transfection of 
these RNA transcripts, selection for mutants of the desired phenotype is performed as 
described above. 

10 

2. Vectors Expressing a Selectable Marker 

Alternatively, an antibiotic resistance marker can be used for selection of 
virus vector variants exhibiting the desired phenotype (Figure 8F). For example, the 
Sindbis vector pRSIN-[igal (Dubensky et aL, ibid) was modified by replacement of the 

15 pgalactosidase reporter gene with a neomycin phosphotransferase selectable marker and 
either subjected to prior mutagenesis or used directly. The gene encoding neomycin 
(G418) resistance was isolated by standard three-cycle PCR amplification, with 1.5 
minutes extension, from plasmid pcDNA3 (Invitrogen, San Diego, CA). using the 
following oligonucleotide primers that were designed to contain flanking Xlio I and Not 

20 I restriction sites: 

Forward primer: NeoFX (5'-rest. site/neo sequence) (SEQ. ID. NO. 26) 
5'-ATATACTCGAG/ACCATGATTGAACAAGATGGATTG-3' 

25 Reverse primer: NeoRN fS'-rest. site/neo sequence) (SEQ. ID. NO. 27) 
5'-TATATAGCGGCCGC/TCAGAAGAACTCGTCAAGAAG-3' 

Following amplification, the DNA fragment was purified with 
QIAquick-spin, digested with Xlxo I and Not I, and ligated into pRSIN-pgal vector that 
30 also had been digested with Xlxo \ and Not I, treated with calf intestinal alkaline 
phosphatase, and purified from a 0.7% agarose gel, away from its previous 
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Pgalactosidase insert, using GENECLEAN II. The newly constructed Sindbis 
expression vector containing the neomycin resistance marker was designated pSin-Neo. 
Plasmid pSin-Neo was linearized with Pme I , either directly or with 1, 2, or 3 rounds of 
prior mutagenesis by passage through E. coli strain XL-1 Red. The linear DNA was 
5 used as template for in vitro SP6 transcription reactions and vector transcripts were then 
transfected into the desired cell type (for example, BHK cells) for initiation of the 
Sindbis replication cycle and heterologous gene expression. Approximately 24 hour 
post-transfection. the BHK cells were trypsinized and replated in media containing 0.5 
mg/'ml G418. Subsequently, the media was changed at approximately 24 hour intervals 

10 to remove dead cells, and replaced with G4l8-containing media consisting of a mixture 
of 50% fresh media and 50% conditioned media. Media changes were reduced after the 
majority of dead ceils were washed away, and cell foci began to form. At this time, all 
cells in control plates transfected with Sindbis vector RNA expressing only a reporter 
gene were killed by the drug. Using this selection, only viable cells which contained a 

15 functional Sindbis virus-derived vector, exhibiting the desired phenotype (as evidenced 
by neomycin resistance), were enriched. Stably transformed neomycin-resistant pools 
were obtained using this approach for both mutagenized and unmutagenized templates, 
and the pools were subsequently characterized. Similar selection approaches were 
demonstrated to work in cells that express higher levels of interferon(s), for example 

20 L929 cells. 

Mutant vector variants displaying the desired phenotype were isolated by 
harvesting RNA directly from the stably transformed cells using RNAzol B (Tel-Test, 
Friendswood, TX), followed by polyA selection. The RNAs were analyzed by northern 
blot, using a neomycin phosphotransferase gene probe, to demonstrate the presence of 

25 both genomic and subgenomic viral vector RNA species (Figure 8G). Lanes SI, S2, 
and S3 represent RNA from three independently derived G418-resistant pools. The 
BHK lane represents untransfected cellular RNA, while the Sin-Neo lane represents the 
original in vitro transcribed RNA vector. Clearly, significant differences in the ratios of 
genomic to subgenomic RNA are observed among the pools, suggesting their derivation 

30 from vectors containing different causal mutations. The isolated RNA also was used to 
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transfer the neomycin resistance phenotype to naive cells. Specifically, BHK-21 cells 
were transfected with the above isolated RNA or a control template RNA and treated 
with the G418 drug. Those cells transfected with the isolated RNA were immediately 
resistant to the drug and grew to confluence within days, unlike the control 
5 transfections. Finally, complementation assays were performed using a defective 
Sindbis virus f3-galactosidase vector (designated Sin-dl-pgal), which is deleted of 
nonstructural gene sequences between nucleotides 422 and 7054. Expression of the (3- 
galactosidase reporter from such a defective vector can occur only after the deleted 
nonstructural proteins are provided in trans, Transfection of Sin-dl-pgal RNA into the 

10 above G418-resistant pools resulted in expression levels of (B-galactosidase not seen in 
similarly transfected control BHK ceils. 

To map the genetic locus of the Sindbis vectors responsible for reduced 
cytopathogenicity. PCR primer pairs were synthesized, allowing division and gene 
substitution of the vector sequences (excluding the 3 '-end UTR) in three distinct 

15 sections: nts. 1 - 2288, nts. 2289 - 4845. and nts. 4846 - 7644 (Figure 8H). These 
primer pairs may be used to amplify cDNA directly from vector variant RNA isolated 
from G418-resistant cells. For example, RNA from the S2 pool (see Figure 8G) was 
subjected to cDNA cloning and substitution into a wild-type pSin-Neo vector. 
Following in vino transcription, wild-type and S2 mutant vector RNA was transfected 

20 into BHK cells and G418 drug selection was applied. Only the S2 mutant vector RNA 
resulted in rapid drug resistance and confluent cell growth within days, suggesting that 
the causal mutation resided within this region of gene replacement. Sequence analysis 
between the wild-type and S2 mutant vectors within this region, revealed a Proline to 
Threonine substitution at nsP2 amino acid 726, within the highly conserved alphavirus 

25 Leu-Xaa-Pro-Gly-Gly motiff. In addition, gene replacement of the same region with 
cDNA derived from pool SI (see Figure 8G) did not result in a vector producing similar 
G418 resistant cells (Figure 8H), nor did the substituted sequence contain a mutation 
within the conserved Leu-Xaa-Pro-Gly-Gly motiff. These data indicate that an 
alternative mutation outside the region of replacement is required for the observed 

30 phenotype. 
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Alphavirus Strain* 


Pro-Gly-GIy Region 


nsP2 a.a 


.'s(P-G-G) 


1. 


Sindbis virus 


Leu-Asn-Pro-GIy-Gly-Thr 


a.a. = 


726-728 


2. 


Sin-Neo S2 vector 


Leu-Asn-Ibx-Gly-GIy-Thr 


a.a. - 


726-728 


3. 


S.A.AR86 virus 


Leu-Asn-Pro-GIy-Gly-Thr 


a.a. = 


726-728 


4. 


Ockelbo virus 


Leu-Asn-Pro-Gly-Gly-Thr 


a.a. = 


726-728 


5. 


Aura virus 


Leu-Lys-Pro-Gly-Gly-Thr 


a.a. = 


725-727 


6. 


Semliki Forest virus 


Leu-Lys-Pro-Gly-Gfy-Ile 


a.a. = 


718-720 


7. 


VEE virus 


Leu-Asn-Pro-GIy-Gly-Thr 


a.a. = 


713-715 


8. 


Ross River virus 


Leu-Xaa-Pro-Gly-Gly-Ser 


a.a. = 


717-719 



In another example, a Semliki Forest virus-derived vector, pSFV-1 
(GIBCO/BRL), was used for insertion of the antibiotic resistance marker and 
5 subsequent selection of the desired phenotype. The gene encoding neomycin (G418) 
resistance was isolated by standard three-cycle PCR amplification, with 1.5 minutes 
extension, from plasmid pcDNA3 (Invitrogen. San Diego, CA), using the following 
oligonucleotide primers that were designed to contain flanking BamH\ restriction sites: 

10 Forward primer: 5'BAMHI-Neo (SEQ. ID. NO. 118) 

5 '-ATATAGGATCCTTCGC ATG ATTGAACAAGATGGATTGC-3 ' 



Reverse primer TBAMHI-Neo (SEQ. ID. NO. 57) 
15 5 ' -AT ATAGGATCCTC AG AAG AACTCGTC AAGAAGGCGA-3 ' 

Following amplification, the DNA fragment was purified with 
QIAquick-spin, digested with BamH I, and ligated into pSFV-1 vector that also had 
been digested with BamH I. treated with calf intestinal alkaline phosphatase, and 
20 purified from a 0.7% agarose gel, using GENECLEAN II. The resulting SFV vector 
construct containing the neomycin resistance marker was designated SFV-Neo. In vitro 
transcription of RNA vector from mutagenized or unmutagenized SFV-Neo DNA 
template was performed and transfection, followed by selection for mutants of the 
desired phenotype, was carried out essentially as described above for Sindbis virus 
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vectors. Several independently-derived, stably transformed G418-resistant pools were 
obtained and characterized. Northern blot analysis for two such pools. SF1 and SF2, 
are shown in Figure 8G, along with the original control RNA vector transcript (SFV- 
Neo). These data demonstrate that the selection methods described in the present 
5 invention have utility for multiple RNA virus vector systems. 

EXAMPLE 3 

Preparation of SINI-Based RNA vector replicons 

0 

A. Construction of the SIN-1 Basic Vector 

SIN-1 derived vector backbones were constructed and inserted into a 
plasmid DNA containing a bacteriophage RNA polymerase promoter, such that 
transcription in vitro produced an RNA molecule that acts as a self-replicating molecule 

5 (replicon) upon introduction into susceptible cells. The basic SIN-1 RNA vector 
replicon was comprised of the following ordered elements: SIN- 1 nsPs genes, 
subgenomic RNA promoter region, a polylinker sequence, which may contain 
heterologous sequence insertions, the SIN-1 3' non translated region (NTR), and a poly 
adenylate sequence. In addition, nsP genes of the desired phenotype, derived using 

0 methods such as those of Example 2, also may be substituted. Following transfection 
into susceptible cells, autonomous replication of the RNA vector replicon occurs as for 
virus, and the heterologous sequences are synthesized as highly abundant subgenomic 
mRNA molecules, which in turn serve as the translational template for the heterologous 
gene product. 

5 The 5' region of the vector, comprised of the SIN-1 nsP genes and 

subgenomic promoter, extends to within two nucleotides of the capsid gene 
translational initiation point. This region was first inserted into the pKSII+ plasmid 
(Stratagene) between the Apa I and XJw I sites. The 5' region of the vector was 
amplified by PCR from the pRSIN-lg plasmid in two overlapping fragments. The first 

0 fragment was generated in a PCR reaction with the following primer pair: 
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Forward primer: %P6AV(Ana I site/SP6 promoter/STN nts 1-18) : (SEQ. ID NO. 12) 
5'- 

TATATGGGCCCGATTTAGGTGACACTATAGATTGACGGCGTAGTACAC 

5 Reverse primer: STN51 60R fSIN nts 5 160-5 140) : (SEQ. ID NO. 28) 
5-CTGTAGATGGTGACGGTGTCG 

The second fragment was generated in a PCR reaction with the following primer pair: 

10 Forward pnmer: *079F (SIN ms 5079-5100) : (SEQ. ID NO. 29) 
5'-GAAGTGCCAGAACAGCCTACCG 

Reverse primer: SIN7643R fbuffer sequence/^ 1 site/ SIN nts 7643-7621) : 
(SEQ. ID NO. 30) 

1 5 5'-TATATCTCGAGGGTGGTGTTGTAGTATTAGTCAG 



The two PCR reactions were performed with the primer pairs shown 
above using the Thermalase (Amresco, Solon. OH), Vent (New England Biolabs. 
Beverly, MA) or KlenTaq thermostable DNA polymerases. Additionally, the reactions 
20 contained 5% DMSO and "HOT START WAX" beads (Perkin-Eimer, Foster City, 
CA). The PCR amplification protocol shown below was used. The extension period 
was 5 minutes or 2.5 minutes for reactions with the SP6-1F/SIN5160R or 
5079F/SIN7643R primer pairs, respectively. 
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Temperature (°C) 


Time (Min) 


No. Cycles 


95 


2 


1 


95 


0.5 




55 


0.5 


35 


72 


5.0 or 2.5 




72 


10 


10 



Following PCR, the two amplified products of 5142 bp {SP6- 
1F/SIN5160R pnmer pair) and 2532 bp (5079F/SIN7643R primer pair) were punned 
(PCR purification kit. Qiagen. Chatsworth. CA), and digested with Apa I and Sfl I (5142 
5 bp amplicon product) or Sfi I and XJio I (2532 bp amplicon product). The digested 
products were purified with GENECLEAN II (Bio 101, Vista, CA) and ligated together 
with pKS^ plasmid (Stratagene, La Jolla. CA) prepared by digestion with Apa I and 
Xho I and phosphatased with CIAP. This construction is known as pKSSIN-l-BV5\ 

The 3' region of the vector, comprised of the viral 3' end, a polyadenylate 
10 tract, and a unique restriction recognition sequence were inserted between the Not 1 and 
Sac I sites of the plasmid pKSSIN-l-BV5\ The 3' region of the vector was amplified by 
PCR from the pRSIN- 1 g plasmid in a reaction containing the following primer pair: 

Forward primer: SIN11386F (buffer sequence/ 'Not 1 site/SfN nts 1 1386-1 1407) : 
15 (SEQ. ID NO. 31) 

5'-TATATATATATGCGGCCGCCGCTACGCCCCAATGATCCGAC 

Reverse pnmer: SIN1 1 703R (buffer sequence/Sac I and Pme \ sites/T40/STN nts 1 1 703- 
20 11698) : (SEQ. ID NO. 32) 

5'-CTATAGAGCT CGTTTAAACT TTTTTTTTTT TTTTTTTTTT 
TTTTTTTTTT TTTTTTTTTG AAATG 
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In addition to the primer pairs shown above, the PCR reaction contained 
Thermalase, Vent or KlenTaq thermostable DNA polymerase, 5% DMSO, and "Hot 
Start Wax" beads (Perkin-Elmer). The amplification protocol was as shown above, but 
with a 72°C extension period of 30 seconds. 
5 The 377 bp amplified product corresponding to the 3' vector end was 

purified, digested with Not I and Sac I. purified, and ligated into pKSSFN-l-BV5', 
which was prepared by digestion with Not I and Sac I and treatment with CIAP. This 
plasmid is known as pKSSIN- 1 -B V. 

Using techniques described above, the lacZ gene encoding the p- 

10 gaiactosidase reporter protein was liberated from the plasmid pSV-p-galactosidase 
(Promega Corp., Madison, WT) with Bam HI and Hind III. and inserted into pKS+ at the 
corresponding enzyme recognition sites. The iacZ gene was digested from this plasmid, 
pKS-p-gal, with XJio I and Not L and inserted into pKSSIN-l-BV, between the Alio I 
and Not I sites. This plasmid is known as SINrep/SIN-1 nsPl-4/lacZ. 

15 Alternatively, the firefly luciferase gene encoding the luciferase reporter 

protein was liberated from the piasmid pT3/T7-LUC (Clontech. Palo Alto, CA) by 
digestion with Hind III, and inserted into pKS+ (Stratagene, La Jolla, CA) at the 
corresponding enzyme recognition sites contained in the multiple cloning sequence to 
generate pKS-luc. The luciferase gene was liberated from pKS-luc by digestion with 

20 XJio I and Not I and inserted into XJw VNot I digested pKSSIN-l-BV. This plasmid is 
known as SINrep/SIN-1 nsPl-4/luc. 

Additionally, the gene encoding the secreted form of alkaline 
phosphatase (SEAP) was inserted into pKSSIN-l-BV. Briefly, the SEAP gene was 
liberated from the plasmid pCMV/SEAP (Tropix, Bedford, MA) by digestion with Hind 

25 III and Xba I, and inserted in pSK+ (Stratagene, La Jolla, CA) at the corresponding 
recognition sites contained in the multiple cloning sequence, to generate pSK-SEAP. 
The SEAP gene was then liberated from pSK-SEAP by digestion with Xho I and Not I, 
and inserted into the corresponding enzyme recognition sites of pKSSIN-l-BV. This 
plasmid is known as SINrep/SIN-1 nsPl-4/SEAP. 
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The individual SIN-1 nsP genes were substituted into the corresponding 
wild-type virus region of the Sindbis virus-based lac Z replicon described previously 
(Bredenbeek et aL. J. Virol (57:6439-6446, 1993), in order to compare the expression 
properties of the SIN- 1 and wild-type expression vectors. Substitution of the SIN-1 nsP 
5 eenes into the Totol 101-derived lac Z replicon was accomplished as described in 
Example 1 . These vectors were designated as shown in the table below. 



Replicon Designation 
SINrep/lacZ 
SINrep/SIN-1 nsP2/lacZ 
SINrep/SIN-1 nsP3/lacZ 
SINrep/SIN-1 nsPl-4/lacZ 



nsP Genes Origin 
Totol 101 SIN-1 
nsP 1-4 

nsP 1,3-4 nsP2 
nsPl-2,4 nsP3 

nsP 1-4 



SP6 transcripts were prepared from the replicons shown in the table 
10 above, after linearization with Xlw I. as described in Example 1. RNA transcripts 
contained a 5' sequence that is capable of initiating transcription of Sindbis virus, 
Sindbis virus nonstructural protein genes 1-4, RNA sequences required for packaging, a 
Sindbis virus junction region, the lacZ gene, and the Sindbis virus 3' end proximal 
sequences required for synthesis of the minus strand RNA. 
15 The in vitro transcription reaction products or purified RNA were 

electroporated into baby hamster kidney-21 (BHK-21) cells as described previously 
(Liljestrom and Garoff, Bio/Technotogy 9:1356-1361, 1991). Alternatively, BHK-21 
cells were complexed with a commercial cationic lipid compound as described in 
Example 1 and applied to BHK-21 cells maintained at 75% confluency. Transfected 
20 cells were propagated in 35 mm dishes, and incubated at 37°C. 

The efficiency of transfection of BHK-21 cells with SINrep/lac Z RNAs 
after 9 hours was determined by two alternative methods. In the first method, 
transfected cells expressing p-galactosidase were determined by direct staining with X- 
gal (5-bromo-4-chloro-3-indolyl-b-D-galactopyranoside), after first fixing cells with 2% 
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cold methanol, as described previously (MacGregor et al., Cell Mol. Genei. 75:253-265, 
1987). In the second method, expression of p-galactosidase were determined by 
immunofluorescence, using a rabbit anti-p-gaiactosidase antibody. A portion of the 
cells transfected by either method described above were propagated on circular glass 

5 coverslips, contained in 35 mm dishes. At 9 hours post transfection (hpt), the media 
was removed by aspiration, and the cells were rinsed twice with PBS, and fixed with 
methanol by incubation overnight at -20°C. The methanol was removed by aspiration, 
the cells were rinsed three times with PBS, then incubated with 2% BSA (fraction V, 
Sigma. St. Louis. MO) for 30 minutes at room temperature. Following incubation with 

0 BSA to prevent non-specific antibody binding, the cells were incubated with the 
primary anti-p-galactosidase antibody (diluted 1:800 in 0.1% BSA/PBS) for 1 hour at 
room temperature. Excess primary antibody was then removed by aspiration and 
nnsing three times with 0.1% BSA in PBS. Following 1:100 dilution in 2% BSA in 
PBS, 100 ml of goat anti-rabbit-FITC conjugate secondary antibody (Sigma, St. Louis, 

5 MO) were added to the coverslips and incubated for 45 minutes at room temperature, in 
the dark. Excess secondary antibody was removed by rinsing the coverslips twice with 
0.1% BSA in PBS. and once with PBS. The coverslips were then mounted cell side 
down on a drop of Cytoseai 60 mounting media (Stephens Scientific, RJverdale, NJ), 
placed on a microscope slide. Fluorescence microscopy was used to determine the 

0 frequency of cells expressing p-galactostdase. in order to determine the transfection 
efficiency. 

The level of p-galactosidase in whole transfected cell lysates was 
determined at 9 hpt, by two alternative methods. In the first method, transfected cells 
were nnsed with PBS after aspiration of the media, and 250 p.1 of reporter lysis buffer 

5 (Promega, Madison, WT) per 10 6 cells was added to each dish, p-galactosidase 
expression levels were determined by mixing the supernatant fraction from cell lysates, 
processed by micro centrifugation at 14,000 r.p.m. for 1 minute at room temperature, 
with a commercially available substrate detection system (Lumi-gal, Clontech, Palo 
Alto, CA), followed by luminometry (Analytical Luminescence Laboratory, San Diego, 

0 CA). In the second method, the activity of p-galactosidase was determined as described 
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previously by Sambrook and Maniatis (1989, 2nd ed. Cold Spring Harbor Laboratory 
Press. NY). Briefly, transfected cells were rinsed with PBS after aspiration of the 
media, and 200 of TTE lysis buffer (10 mM Tris-HCl pH 8.0/1 mM EDTA/0.2% 
Triton X-100) per 10 A cells was added to each dish. Following pelleting of cell debris 

5 from the cell lysate by micro centrifugation at 14,000 r.p.m. for 1 minute at room 
temperature, 50 u-1 of the supernatant was added to 0.5 ml of Z buffer pH 7.0 (60 mM 
Na,HPO 4 /40 mM NaH.PO,/ 10 mM KC1/10 mM MgS04/50 mM 2-mercaptoethanol) 
and incubated at 37°C for 5 minutes, p-galactosidase activity in samples was then 
determined by spectrophotometry (420 nm) after addition of 0.2 ml of a solution 

10 containing 4 mg/ml of the chromogenic substrate o-nitrophenyl-b-D-galactopyranoside 
(ONPG; SIGMA. St. Louis. MO), in Z buffer. The samples were incubated at 37°C 
until the yellow color developed (approximately 5 minutes), and the reactions were 
terminated by addition of 0.5 ml of 0.5 M NaX0 3 . 

The level of p-galactosidase expression in transfected BHK-21 cells was 

15 determined according to the methods described above, and is illustrated in the table 
shown below. The data indicated are normalized for varying transfection efficiency, as 
described above. 



Replicon Transfected Expression of p-gal, Standard Deviation 

relative to SINrep/lacZ 

SINrep/iacZ 1.0 

SINrep/SIN-1 nsP2/lacZ 3.2 ±0.4 

SINrep/SIN-l nsP3/lacZ 0.2 ±0.0 

SINrep/SIN-1 nsPl-4/lacZ 5.2 +1.3 

20 The results demonstrate that the replicon vectors derived from the SIN-1 

variant strain are indeed functional. When transfected into BHK-21 cells, the level of 
expressed reporter protein are higher than that observed in wild type virus-derived 
vector transfected cells. Furthermore, as with the phenotype for establishment of 
productive persistent infection, the higher level of p-galactosidase expression in BHK- 
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21 cells transfected with the SIN- 1 -derived replicon vectors mapped primarily to the 
nsP 2 gene. 

EXAMPLE 4 

5 Preparation of SINI-Based DNA Vectors 

A. Construction of Plasmid DNA SIN-1 Derived Ex pression Vectors 

Efficient initiation of the Sindbis virus infectious cycle can occur in vivo 
from a genomic cDNA clone contained within an RNA polymerase II expression 

10 cassette (Dubensky et aL J. Virol 70:508-519, 1996.). The ability to express functional 
alphavirus genes from a DNA format has enabled two new alphavirus-based gene 
expression systems to be developed: (1) a plasmid DNA-based vector with applications 
for genetic immunization, and (2) the production of packaged alphavirus panicles in 
ceils co-transfected with vector replicon and DH plasmid DNAs. Previously, molecular 

15 approaches to produce infectious Sindbis virus RNA and its derived complementary 
vectors were restricted primarily to in vitro transcription of cDNA clones from a 
bactenophage RNA polymerase promoter followed by transfection into permissive 
cells. 

The plasmid DNA-based alphavirus derived expression vector is known 
20 as ELVS™ (Eukaryotic Layered Vector System). The ELVS™ plasmid DNA vector 
involves the conversion of a self-replicating vector RNA (replicon) into a layered DNA- 
based expression system. Within certain embodiments the first layer has a eukaryotic 
(e.g. RNA polymerase II) expression cassette that initiates transcription of a second 
layer, which corresponds to the RNA vector replicon. Following transport of the 
25 replicon expressed from the first layer from the nucleus to the cytoplasm, autocatalytic 
amplification of the vector proceeds according to the viral (e.g. alphavirus) replication 
cycle, resulting in expression of the heterologous gene. 

Construction of plasmid DNA expression vectors derived from SIN-1 
virus or those variants selected as taught in Example 2 were performed with 
30 modifications of methods previously described (Dubensky et al., /. Virol 70:508-519, 
1996). The expression vector was assembled on the plasmid vector pBGS131 (ATCC 
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No. 37443), which is a knamycin resistant analogue of pUC 9 (Spratt et al., Gene 
47:337-342. 1986). pBGS131 and its derived plasmids were propagated in LB medium 
containing 20 u-g/ml kanamycin. 

To facilitate insertion of heterologous sequences into the expression 
5 vector, the Xho I recognition sequence, located within the translational reading frame of 
the kanamycin gene in pBGSl31. was removed by inserting a partially complementary 
12-mer oligonucleotide pair that contained XJw I sticky ends. The Xiio 1 recognition 
site was lost as a result of the insertion, as shown below: 



10 Oligonucleotide 1: (SEQ. ID NO. 33) 
5'-TCGATCCTAGGA 



pBGS131 sequence after 
Original pBGS131 sequence Paired Oligonucleotides insertion of 12-mer 



SerArg ~~ SerlleLeuGlySerArg 

CTCGAGGC TCGATCCTAGGA CTCGATCCTAGGATCGAGGC 

GAGCTCCG AGGATCCTAGT GAGCTAGGATCCTAGCTCCG 



(A7io I site: CTCGAG) (SEQ. IDNOS. 33-36 and 104). 

15 

The oligonucleotide is gel annealed in equal molar concentrations in the 
presence of 10 rruM MgCl 2 , heated to 100°C for 5 min. cooled slowly to room 
temperature, and phosphorylated with polynucleotide kinase. The oligonucleotide was 
ligated at a 200:1 ratio of insertplasmid vector to pBGS131, which was prepared by 

20 Xiio I digestion and CIAP treatment. The resulting plasmid is called pBGS131 dXXho I. 
The growth rates of XLl-Blue (Stratagene) transformed with pBGS131 or pBGS131 
d\Xho I plasmids in LB medium containing 20 pg/ml kanamycin was indistinguishable 
over a time course between 1.5 and 8 hours. 

The bovine growth hormone (BGH) transcription 

25 termination/polyadenylation signal was inserted between the Sac \ and Eco RI sites of 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/2I062 



131 

pBGS131 dlYho \. The BGH transcription termination sequences were isolated by PCR 
amplification using the primer pair shown below and the pCDNA3 plasmid (Invitrogen, 
San Diego, CA) as template. 

5 Forward primer BGHTTF (buffer sequence/flic I site/pC DNA3 nts 1 132-1 161V 
(SEQ. ID NO. 37) 

5'-TATATATGAGCTCTAATAAAATGAGGAAATTGCATCGCATTGTC 

Reverse primer BGHTTR (buffer sequence//^ RT site/pCDNA3 nts 1180-1 154V 
10 (SEQ. ID NO. 38) 

5'-TATATGAATTCATAGAATGACACCTACTCAGACAATGCGATGC 

The pnmers shown above were used in a PCR reaction with a three 
temperature cycling program, using a 30 sec extension period. The 58 bp amplified 

15 product was purified with the PCR purification kit (Qiagen Chatsworth, CA), digested 
with Sac 1 and Eco RL purified with GENECLEAN II, and ligated into Sac XIEco RI 
digested. CIAP treated pBGS131 . The plasmid is known as pBGS131 dVOio I-BGHTT. 

The 3' end of the Sindbis virus-derived plasmid DNA expression vector 
was then inserted into the pBGS131 dlA7io I-BGHTT construct. This region of the 

20 vector contains the following ordered elements: Sindbis virus 3' end non-translated 
region (3' NTR); a 40-mer poly(A) sequence; the hepatitis delta virus (HDV) 
antigenomic ribozyme; and a Sac I recognition sequence. Construction of these ordered 
elements was accomplished by nested PCR, using the primers shown below and the 
pKSSrN-l-BV plasmid (Example 4) as template. 

25 

Forward primer: SIN1 1386F fbuffer sequence/Afo/ 1 site/SIN nts 1 1386-1 1407V 
(SEQ. ID NO. 31) 

5-TATATATATATGCGGCCGCCGCTACGCCCCAATGATCCGAC 
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Nested primer: pAHDVI F (polvr AVHDV RBZ nts 1 -46V (SEQ. ID NO. 39) 
5'-AAAAAAAAAA GGGTCGGCAT GGCATCTCCA CCTCCTCGCG 
GTCCGACCTG GGCATC 



5 Reverse primer: SacHDV77R (buffer sequenc e/.^ I site/HDV RBZ nts 77-27): 
(SEQ. ID NO. 40) 

S'-TATATGAGCTCCTCCCTTAGCCATCCGAGTGGACGTGCGTCCTCCTT 
CGGATGCCCAGGTCGGACCGCG 



10 The primers shown above were used in a PCR amplification according to 

the reaction conditions and three temperature cycling program described in Example 4, 
with an extension time of 30 sec. The 422 bp amplified product was purified with a 
PCR purification kit fQiagen Chatsworth. CA), digested with Not I and Sac I, punfied 
with GENECLEAN IL and ligated into Not USac I digested, CIAP-treated pKSSIN-1- 

15 BV. This construct is known as pKSSIN-1 BV/HDVRBZ and contains Sindbis virus- 
derived plasmid DNA expression vector sequences from the Bgl II site at Sindbis nt 
2289 extending through the 3' end of the vector including the HDV ribozyme sequence. 

Plasmid pKSSIN-1 BV/HDVRBZ was then digested with Bgl II and 
Sac I, the 5S15 bp fragment was isolated by 1% agarose/TBE gel electrophoresis, 

20 punfied with GENECLEAN II, arid was inserted into Bgl IVSac I digested. CIAP- 
treated pBGS131 dWio I-BGHTT to generate the plasmid construct known as 
pBG/SIN-lBglLF. This construct contains the region of the Sindbis virus expression 
vector from plasmid pKSSIN-1 BV/HDVRBZ described above with the 3' end fused to 
the BGH transcription termination sequence on the pBGS131 dlA770 I plasmid. 

25 Assembly of the Sindbis virus plasmid DNA vector was completed by 

insertion of the CMV promoter juxtaposed with the first 2289 nts of the Sindbis virus 
genome (includes the 5' viral end and a portion of the nsPs genes) into the pBG/SIN- 
IBglLF plasmid. Using an overlapping PCR approach, the CMV promoter was 
positioned at the 5' viral end such that transcription initiation results in the addition of a 

30 single non-viral nucleotide at the 5' end of the Sindbis virus vector replicon RNA. The 
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CMV promoter was amplified in a first PCR reaction from pCDNA3 (Invitrogen, San 
Diego, CA) using the following primer pair: 

Fonvarri primer: pCfl?/233F (buffer sequen ce/Bgl II recognition sequcncc/CMV 
5 promoter ms 1-221 : (SEQ. ID NO. 4 1 ) 

5'-TATATATAGATCTTTGACATTGATTATTGACTAG 

Reverse primer: SNCMV1142R fSIN nts S-l/CMV pro ms 11 42-11 08) : 
{SEQ. ID NO. 42) 

1 0 5*-CCGTCAATACGGTTCACTAAACGAGCTCTGCTTATATAGACC 

The primers shown above were used in a PCR reaction according to the 
reaction conditions and three temperature cycling program described in Example 4, with 
an extension time of 1 min. 
15 The SIN-1 5' end was amplified in a second PCR reaction from 

pKSRSIN-lg clone (Example 1) using the following primer pair: 

Forward primer: CMVSIN1 F fCMV pro ms 1 1 24-1 142/STN nts 1-20) : 
(SEQ. ID NO. 43) 

20 

5'-GCTCGTTTAGTGAACCGTATTGACGGCGTAGTACACAC 



Reverse primer: STN 31 82R (SIN nts 3182-3160) : (SEQ. ID NO. 44) 

25 

5'-CTGGCAACCGGTAAGTACGATAC 

The primers shown above were used in a PCR reaction with a three 
temperature cycling program using a 3 min extension period. 
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The 930 bp and 3200 bp amplified products were purified with a PCR 
purification kit (Qiagen) and used together in a PCR reaction with the following primer 
pair: 

5 Forward primer: pCgg/233F : (SEQ. ID NO. 41) 

S'-TATATATAGATCTTTGACATTGATTATTGACTAG 
Reverse primer: ( SIN nts 2300-2278) : (SEQ. ID NO. 45) 

10 

5'-GGTAACAAGATCTCGTGCCGTG 

The primers shown above were used in a PCR reaction with a three 
temperature cycling program using a 3,5 min extension period. 

15 The 26 3' terminal bases of the first PCR amplified product overlap with 

the 26 5' terminal bases of the second PCR amplified product; the resultant 3200 bp 
overlapping secondary PCR amplified product was purified by 1% agarose/TBE 
electrophoresis, digested with Bgl II, and ligated into Bgl II digested, CIAP-treated 
pBG/SIN-lBglLF. This construct is called pBG/SIN-1 ELVS 1.5. 

20 As discussed within Example 1, relatively few nucleotide point changes 

in the nsP gene sequence of wild-type Sindbis virus result in the phenotype 
characteristic of SIN-1 . No new restriction enzyme recognition sites are generated as a 
result of these nucleotide changes which facilitate clones derived from wild-type and 
SIN-1 genotypes to be easily distinguished. A PCR-based diagnostic assay was 

25 therefore devised as a rapid method for identification of SIN-1 derived clones. Briefly, 
forward primers were designed so that a particular base change between SIN-1 and 
wild-type was positioned at the 3' terminal base of the primer. One primer contained 
the SIN-1 nucleotide while another contained the wild-type nucleotide. A reverse 
primer in a region downstream conserved between both genotypes was used in 

30 combination with each forward primer. At the correct annealing temperature, SIN-I 
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templates were only amplified in reactions containing SIN-1 forward primers. The 
pnmer sequences used to distinguish wild-type and SIN-1 genotypes is given below. 
The reaction conditions were as described throughout the examples contained herein. 

5 Primer Set I : 

Forward primers: 

WT 1 OOF S'-GTC CGT TTG TCG TGC AAC TGC 

(SEQ. ID NO. 105) 

1 0 SIN- 1 1 OOF: 5'-GTC CGT TTG TCG TGC AAC TGA 

(SEQ. ID NO. 106) 

Peverse primer: 
SIN2300R 

15 

PCR Program: (95°C-30", 72°C-2') 20 cycles 

Pnmer Set 2 : 

Forward primers: 

20 WT3524F 5'-CAA TCT TCC TCA CGC CTT AGC 

(SEQ. ID NO. 107) 

SIN-1 3524F 5'-CAA TCT TCC TCA CGC CTT AGT 
(SEQ. ID NO. 108) 

25 

Reverse primer: 
SIN5448R 

PCR Program: (95°C-30'\ 60°C-30", 72°C-2') 20 cycles 

30 
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Primer Set 3 : 

Forward primers: 

WT7592F 5'TCC TAA ATA GTC AGC ATA GTA 

(SEQ. ID NO. 109) 

5 

SIN-17592F 5TCC TAA ATA GTC AGC ATA GTT 
(SEQ. ID NO. 110) 

Reverse primer: 

10 SIN7643R 5*-TATATCTCGAGGGTGGTGTTGTAGTATTAGTCAG 

(SEQ. ID NO. Ill) 

PCR Program: (95°C-30 M . 60°C-30", 72°C~2') 20 cycles 

15 Reponer protein expression vectors were constructed by inserting the 

lacZ, SEAP, or luciferase reponer genes into the pBG/SIN-1 ELVS 1.5 vector 
backbone. In separate reactions, the pKS-P-gal, pSK-SEAP, and pKS-luc plasmids 
(Example 4), were digested with XJio I and Not I The fragments containing the lacZ, 
SEAP, or luciferase genes were isolated by 1% agarose/TBE gel electrophoresis and 

20 purified subsequently with GENECLEAN II. These reponer genes were then ligated in 
separate reactions with XlioVNot I digested. CIAP-treated pBG/SIN-1 ELVS 1.5 
plasmid. These constructs are known as pBG/SIN-1 ELVS 1.5-p-gai, pBG/SIN-1 
ELVS 1.5-SEAP, andpBG/SIN-1 ELVS 1.5-luc. 

25 B. Expression of Heterologous Proteins in Cells Transfecte d with pBG/SIN-1 
ELVS 1.5-SEAP. pBG/SIN-1 ELVS 1.5-luc or nBG/SIN-1 ELVS 1.5-B-eal 
Ex pression Vectors 

The pattern of secreted alkaline phosphatase, luciferase, and p- 
galactosidase reponer gene expression in BHK cells transfected with pBG/SIN-1 ELVS 
30 1.5 or pBG/wt ELVS 1.5 vectors was compared. The pBG/wt ELVS 1.5 plasmid 
contains sequences derived from wild-type Sindbis virus, rather than the SIN-1 variant. 
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Construction of the pBG/wt ELVS 1.5 expression vectors was exactly as described 
herein for the pBG/SIN-1 ELVS 1.5 expression vectors, except that full-length genomic 
cDNA derived from wild-type Sindbis virus (Dubensky et al., WO 95/07994) was used 
as the template for the vector construction. Construction of the pBG/wt ELVS 1.5 
5 expression vector has been described previously (Dubensky, supra.); thus, although the 
strains, and therefore the sequences, are different, the Sindbis virus-specific regions 
contained in the pBG/wt ELVS 1.5 and pBG/SIN-1 ELVS 1.5 expression vectors are 
the same. 

Baby hamster kidney-21 (BHK-21) cells maintained at 75% confluency 

10 in 12 mm dishes were transfected with 1.0 ug of pBG/SIN-1 ELVS 1.5 or pBG/wt 
ELVS 1.5 expression vector plasmid DNAs complexed with 4.0 ul of a commercially 
available lipid (Lipofectamine, GIBCO-BRL). Otherwise, transfection conditions were 
as suggested by the lipid manufacturer. Eagle minimal essential medium supplemented 
with 5% fetal bovine sera was added to the ceils at 4 hours post transfection (hpt), 

15 unless otherwise indicated. Transfected cells were incubated at 3 7°C. At various times 
post transfection, as indicated below, several assays were performed to compare vector- 
specific RNA synthesis, and expression of secreted alkaline phosphatase, luciferase, or 
p-galactosidase reporter gene expression in cells transfected with pBG/SEN-l ELVS 1.5 
orpBG/wt ELVS 1.5 plasmid DNAs. 

20 The levels of alkaline phosphatase secreted into the culture medium of 

BHK cells transfected with pBG/SrN-1 ELVS-1 L5-SEAP or pBG/wt ELVS 1.5-SEAP 
plasmid DNA were compared. Cell culture medium was assayed for the presence of 
alkaline phosphatase with the Phospha-Light™ chemi luminescent reporter gene assay, 
according to the directions of the manufacturer (Tropix, Inc., Bedford, MA). Briefly, 

25 lOul of cell culture supernatant was mixed with 30ul of Dilution Buffer and incubated 
for 30 minutes at 65°C. The sample was allowed to cool to room temperature before 
mixing with 40ul of Assay Buffer. The sample was incubated for five minutes at room 
temperature followed by the addition of 40u,l of Reaction Buffer. Samples were 
incubated for 20 minutes at room temperature. Total luminescence was measured on an 

30 ML3000 microtiter plate luminometer (Dynatech, Inc., Chantilly, VA) in cycle mode. 
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and alkaline phosphatase (AP) in the culture medium of BHK cells transfected with 
pBG/SIN-1 ELVS-I 1.5-SEAP or pBG/wt ELVS 1.5-SEAP plasmid DNAs, were 
determined at 48 hpt, and the results are shown in the table below. 

Plasmid Transfected RLU at 48 hpt 

pBG/SIN-1 ELVS 1.5-SEAP 18 ± 1.7 

pBG/wtELVS 1.5-SEAP 94 ±10.7 

pCDNA3 0.13 ±0.04 

5 

Additionally, the levels of vector-specific RNAs synthesized in BHK-21 
cells transfected with pBG/SrN-1 ELVS 1.5-SEAP or pBG/wt ELVS 1.5-SEAP 
plasmids were determined by Northern blot analysis, exactly as described previously 
(Dubensky, supra.), at 48 hours post-transfection. The results of this experiment are 

10 shown in Figure 9A. Total cellular RNA was isolated from transfected BHK cells with 
Tri-Reagent as described by the manufacturer (Molecular Research Center, Inc., 
Cincinnati, OH). Total cellular RNA concentrations present in samples from 
transfected BHK cells were determined spectrophotometrically. Additionally, material 
isolated from transfected cells was determined to be intact by electrophoresis of 0.5 ug 

15 of total cellular RNA through 0.7% agarose/TBE mini gels, stained 10 ul/ml of 
ethidium bromide. Northern blot analysis was performed according to Sambrook and 
Maniatis (1989, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.). In 
order that RNA from all transfected samples could be visualized on a autoradiogram 
from a single Northern blot analysis, 2.5 ug and 30 ug of RNA were loaded per lane 

20 from pBG/wt ELVS 1.5-SEAP and pBG/SIN-1 ELVS 1.5-SEAP transfected cells, 
respectively. Four samples of RNA, from individual transfections with both plasmids 
tested, were electrophoresed through 0.7% formaldehyde agarose gels and transferred to 
Zeta-probe membrane (Bio-Rad, Richmond, CA). The blot was hybridized with 
random-primed probes corresponding to the alkaline phosphatase gene. The results of 

25 this experiment in which the levels of vector-specific RNA synthesis and AP expression 
in transfected BHK cells were compared at 48 hpt, demonstrate that while the level of 
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vector-specific RNA synthesized in cells transfected with pBG/SIN-1 ELVS 1.5-SEAP 
DNA was at least 100-fold lower than in pBG/wt ELVS 1.5-SEAP transfected cells, the 
levels of AP were only 5-fold lower in cells transfected with pBG/SIN-1 ELVS 1.5- 
SEAP DNA, compared to pBG/wt ELVS 1.5-SEAP DNA. 
5 The levels of alkaline phosphatase secreted into the culture medium of 

BHK cells transfected with pBG/STN-1 ELVS-1 1.5-SEAP or pBG/wt ELVS 1.5-SEAP 
plasmid DNA were also compared over a 7 day time-course. The results of this study, 
illustrated in Figure 9B, demonstrate that the levels of AP present in the culture medium 
at early time points were much lower in cells transfected with pBG/SIN-l ELVS-1 1.5- 

10 SEAP plasmid. compared to pBG/wt ELVS 1.5-SEAP plasmid. However, the level of 
AP expressed in cells transfected with the SIN-1 Sindbis virus variant strain-derived 
vectors rapidly increased and was higher than in cells transfected with wild-type virus- 
derived vectors by the 96 hpt time point. 

The luciferase levels present in BHK cells transfected with pBG/SIN-1 

15 ELVS-1 1.5-luc or pBG/wt ELVS 1.5-luc plasmid DNA were compared at 24, 48, and 
72 hpt. The luciferase expression levels were quantitated by adding 250 nl of reporter 
lysis buffer (Promega, Madison WI) per 10 6 transfected cells, centrifuging the lysate at 
14,000 rpm for 1 min, and then mixing the supernatant fraction from the cell lysates 
with a commercially available substrate detection system (Promega, Madison WI), 

20 followed by luminometry (Analytical Luminescence Laboratory, San Diego, CA). The 
results from this experiment (shown in the table below), and shown graphically in 
Figure 10, parallel the results observed with the alkaline phosphatase expression 
vectors. At early times post transfection the luciferase expression levels were lower in 
BHK cells transfected with pBG/SIN-1 ELVS 1.5-luc plasmid, compared to pBG/wt 

25 ELVS 1.5-luc plasmid. However, at the 48 and 72 hpt time points, the luciferase levels 
were similar in BHK cells transfected with Sindbis virus SIN-1 variant strain- and wild- 
type-derived expression vectors. 
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Plasmid Transfected 



pBG/SIN-1 ELVS 1.5-luc 
pBG/wt ELVS 1.5-luc 



Hr. Post 
Trans- 
fee ti on 



24 



Relative Light Units 
(Ave. ± SD) 



1.2 x 10 V 
2.1 x 10 s 



pBG/SIN-1 ELVS 1.5-luc 
pBG/wt ELVS 1.5-luc 



48 



3.3 x 10 9 
4.1 x 10* 



pBG/SIN-1 ELVS 1.5-luc 
pBG/wt ELVS 1.5-luc 



72 



1.8 x 10" 
5.8 x 10 9 



pCDNA3 



48 



482 



The efficiency of transfection of the pBG/SIN-1 ELVS 1.5-P-gal and 
pBGAvt ELVS 1.5-p-gal plasmids in BHK cells at 48 hpt was determined by direct X- 
gal (5-bromo-4-chloro-3-indolyl-b-D-galactopyranoside) staining of the cell monolayer 
5 (MacGregor, Cell Mol Genet, 75:253-265, 1987), in order to measure directly the 
number of cells expressing p-galactosidase. The transfection efficiencies were 
equivalent, and are shown in the table below. 

Plasmid Transfected No. Blue Cells/lOOX Field 

pBG/SIN-1 ELVS 1.5-P-gal 24 + 3 

pBG/wtELVS 1.5-p-gal 26 ± 7 



10 The levels of vector-specific RNAs synthesized in BHK-21 cells 

transfected with pBG/SIN-1 ELVS 1.5-p-gal or pBG/wt ELVS 1.5-p-gal plasmids 
were determined by Northern blot analysis, exactly as described previously (Dubensky, 
supra.), at 48 and 72 hours post-transfection. Total cellular RNA was isolated from 
transfected BHK cells with Tri-Reagent as described by the manufacturer (Molecular 

15 Research Center, Inc., Cincinnati, OH). Total cellular RNA concentrations present in 
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samples from BHK cells transfected with pBG/SIN-1 ELVS 1.5-p-gal or pBG/wt 
ELVS 1 .5-p— gal plasmids were determined spectrophotometrically. Additionally, 
material isolated from transfected cells was determined to be intact by electrophoresis 
of 0.5 ug of total cellular RNA through 0.7% agarose/TBE mini gels, stained 10 ul/ml 
5 of ethidium bromide. Northern blot analysis was performed according to Sambrook and 
Maniatis (1989, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.). 
In order that RNA from all transfected samples could be visualized on a autoradiogram 
from a single Northern blot analysis, 2.5 ug and 5 ug of RNA were loaded per lane 
from pBG/wt ELVS 1.5-P-gal and pBG/SIN-1 ELVS 1.5-P-gal transfected cells, 

10 respectively. Two samples of RNA. from individual transfections with both plasmids 
tested, at 48 and 72 hpt, were electrophoresed through 0.7% formaldehyde agarose gels 
and transferred to Zeta-probe membrane (Bio-Rad, Richmond, CA). The blot was 
hybridized with random-primed probes corresponding to the p-galactosidase gene. The 
results of this experiment, shown in Figure 11 A, demonstrate that the level of vector 

15 specific RNA synthesized in cells transfected with pBG/SIN-1 ELVS 1.5-P-gal DNA at 
48 and 72 hours post transfection was at least 100- fold lower than the level of RNA 
detected in pBG/wt ELVS 1.5-p-gal transfected cells. 

Additionally the p-galactosidase expression levels were quantitated in 
transfected whole cell iysates by adding 250 ul of reporter lysis buffer (Promega, 

20 Madison WI) per 10 6 transfected cells, centrifuging the lysate at 14,000 rpm for 1 min, 
and then mixing the supernatant fraction from the cell Iysates with a commercially 
available substrate detection system (Clontech. Palo Alto, CA), followed by 
luminometry (Analytical Luminescence Laboratory, San Diego, CA). The results from 
this experiment (shown in the table below, and graphically in Figure 9C) demonstrate 

25 that at early times post transfection the p-galactosidase expression levels were 
significantly lower in BHK cells transfected with pBG/SIN-1 ELVS 1,5-P-gal plasrmU 
compared to pBG/wt ELVS 1.5-p-gal plasmid. However, reporter expression rapidly 
increased over the time-course in pBG/SIN-1 ELVS 1.5-p-gal transfected cells such 
that the p-galactosidase levels were higher than in wild-type virus transfected cells at 

30 the final 1 20 hr time point. 
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Plasmid Transfected 



Hr. Post 
Trans- 
fection 



Relative Light Units 
(Ave, t SD) 



pBG/SIN-1 ELVS 1.5-P-gai 
pBG/wt ELVS 1.5-P-gal 



24 



54030 ± 7348 
4801590 + 74,425 



pBG/SIN-1 ELVS 1.5-P-gal 
pBGAvt ELVS 1.5-P-gal 



48 



310830± 31083 
2214921 + 248071 



pBG/SIN-I ELVS 1.5-P-gal 
pBGAvt ELVS 1.5-P-gai 



72 



1443474 ±98156 
3793524 + 857336 



pBG/SIN-1 ELVS 1.5-P-gal 
pBGAvt ELVS 1.5-p-gal 



96 



2232585 + 299166 
3514262 + 548225 



pBG/SrN-l ELVS 1.5-P-gal 
pBGAvt ELVS 1.5-P-gal 



120 



3200910+ 128036 
1986537 + 166869 



pCDNA3 



3637 



Expression of p-galactosidase in cells transfected with the pBG/SIN-1 
ELVS-1 1.5-P-gal or pBGAvt ELVS 1.5-p-gaI plasmid DNAs was also measured 
5 directly by Western blot analysis using a monoclonal antibody specific for the reporter 
protein (Boehringer Mannheim), at the final 120 hpt time point. In parallel with the 
reporter protein activity determined at 120 hpt, the level of p-galactosidase protein was 
at the same level, or greater, in PBG/SIN-1 ELVS-l 1.5-P-gal transfected BHK cells, 
compared to pBGAvt ELVS 1.5-p-gal, and is demonstrated in Figure 1 1C, 
10 Taken together, the results described herein demonstrate that the level of 

vector-specific RNA synthesized is at least 100-fold less in pBG/SIN-1 ELVS 
transfected cells, compared to pBGAvt ELVS transfected cells. Importantly however, 
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after a 48-72 hour lag, the levels of reporter gene expression are equivalent, or higher, 
in pBG/SIN-l ELVS transfected ceils, compared to pBG/wt ELVS transfected cells. 
The phenotype of pBG/SEN-l ELVS, characterized by high expression levels combined 
with low vector-specific RNA synthesis in transfected cells, is due likely to the 
5 diminished, or absent, inhibition of host cell protein synthesis. This property of 
pBG/SIN-I ELVS thus results in much higher levels of expressed reporter protein per 
subgenomic mRNA translation template in transfected cells, compared to pBG/wt 
ELVS. In summary, the phenotype of the plasmid DNA expression vectors derived 
from the SIN-1 variant strains follows the parent virus, in terms of equivalent 
10 expression levels, combined with relatively low levels of RNA synthesis, compared to 
wild-type virus derived-vectors. As vectors do not contain any of the Sindbis virus 
structural proteins, this phenotype must map to the nonstructural genes of the SIN-1 
virus variant. 



15 

EXAMPLE 5 

MODIFICATIONS OF PLASMID DNA SIN- 1 DERIVED EXPRESSION VECTORS 



Expression levels of heterologous genes in target cells from alphavirus- 
20 based vectors are affected by several factors, including host genus and vector 
configuration. For example, p-galactosidase expression levels are 10- to 100-fold 
higher in BHK cells, compared to some human cells, such as HT1080, transfected with 
pBG/ELVS vectors. The levels of reporter gene expression in BHK and several human 
cell lines transfected with pBG/wt ELVS 1.5-pga! plasmid DNA (see example 4) were 
25 compared in order to establish the relative level of vector-specific expression in cell 
types derived from the intended in vivo target genus. The levels of p-galactosidase 
expression in BHK ceils and HT1080 (ATCC CCL 121) cells, a human fibrosarcoma 
line, transfected with pBG/wt ELVS 1.5-pgal plasmid, or with a conventional plasmid 
expression vector, were determined. The conventional plasmid vector was constructed 
30 by insertion of the lac Z gene (Promega, Madison, WI) into the CMV promoter-driven 
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pUC-derived expression plasmid multiple cloning site (Invitrogen, San Diego, CA), and 
is known as pCMV-P-gal The results of this study, given in Figure 12 A. demonstrate 
that the level of p-galactosidase expression was nearly 100-fold lower in HT1080 cells 
transfected with pBG/wt ELVS 1.5-PgaI plasmid DNA, compared to BHK cells. Cells 
5 were also transfected with pCMV-P-gal in order to segregate RNA polymerase II 
expression from Sindbis virus vector replicon expression. In this experiment, while the 
expression decreased 5- to 10-fold in HTI080 cells transfected with pCMV-p-gai 
plasmid. compared to BHK ceils, expression decreased nearly 100-fold in HT 1080 
cells transfected with pBG/wt ELVS 1.5-Pgal plasmid DNA. Thus, the results indicate 

10 that the dramatic decrease of reporter gene expression in HT1080 cells transfected with 
pBG/wt ELVS 1.5-pgal plasmid DNA is due in pan to the diminished activity of the 
Sindbis virus vector replicon in these human cells. 

Given the overall plasticity of the RNA alphaviral genome and the 
propagation of virus in BHK cells, it is not surprising that the expression levels of 

15 heterologous genes are highest in the host cell lines from which the vectors were 
derived. Thus, selection of alphaviruses with the SIN- 1 phenotype (as described in 
Examples 1 and 2). characterized by comparatively low viral RNA levels and equivalent 
virus production levels, combined with delayed or absent inhibition of host cell protein 
synthesis, can be performed in any human primary, or diploid or polyploid human cells. 

20 In addition to selecting alphaviruses with desired phenotypes in cells 

(e.g., human) which more closely parallel target cells in vivo, several alternative 
modifications of the prototype plasmid DNA expression cassette components can also 
be performed. For example, substitution of the MoMLV RNA polymerase II promoter 
with the stronger CMV immediate early (IE) promoter significantly enhances the level 

25 of heterologous gene expression in transfected cells (Dubensky et al., J. Virol. 70:508- 
519, 1996, and Dubensky et al., W/O 95/07994). Further, juxtaposition of introns, for 
example SV40 small t antigen or CMV intron A, either upstream or downstream from 
the heterologous gene, can increase the level of heterologous gene expression in some 
transfected cell types (Dubensky et al., supra.). 
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Several further alternative modifications of the prototype plasmid DNA 
expression cassette components can also be utilized in order to enhance the overall 
expression in transfected cells in vitro or in vivo. In one modification, the Hepatitis B 
virus (HBV) posttranscriptional regulatory element (PRE) was inserted in the pBG/wt 
5 ELVS 1.5-pgal plasmid DNA. The PRE sequence activates the transport of HBV S 
transcripts in cis from the nucleus to the cytoplasm. The PRE sequence appears to 
function independently of splice donor and acceptor sites, and has been shown to 
activate cytoplasmic expression of a (3-globin transcript not containing introns. It has 
been proposed that the PRE functions in cis to allow the export of nuclear transcripts 
10 that do not interact efficiently with the splicing pathway and hence are not exported 
well from the nucleus (Huang et al.. Molecular and Cellular Biology 75:3864-3869, 
1995). 

The PRE sequence was cloned into pBG/SrN-1 ELVS 1.5-pgal by 
isolating first a PCR-generated 564 bp fragment of HBV from the full length genomic 
15 clone of the ADW viral strain, pAM6 (ATCC No. 39630). The amplified fragment 
extends from base 1238-1802 of the HBV genome. The primer sequences are given 
below. 

Forward Pnmer- VTPRE1238F (SEQ. ID NO. 46) 
20 S'-CCTATGCGGCCGCGTGGAACCTTTGTGGCTCCTC 

Reverse Primer: FAPRE1 802R (SEQ. ID NO. 47) 
5'-CCTATTGGCCAGCAGACCAATTTATGCCTAC 

25 

The primers introduce a Not I recognition site at the 5' end of the 
fragment and an Eae I recognition site at the 3' end. Not I and Eae I have compatible 
sticky ends. The Eae I recognition site is internal to the Not I site, so Eae I cuts both 
30 sites. 
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The PCR fragment was digested with Eae I and cloned into Not I 
digested/CIAP treated pBG/wt ELVS 1.5-pgal. The correct clone retains the Not I site 
at the 5' terminus of the PRE and is called pBG/wt ELVS/PRE 1.5-Pgai. 

The possible effect of the PRE sequence contained in ELVS plasmids on 
5 heterologous gene expression in transfected cells was determined. Briefly, BHK and 
HT1080 cells were cultured in 12 mm dishes to 75% confluency and transfected with 
500 ng of pCMV-P-gal, pBGAvt ELVS 1.5-Pgal, or pBG/wt ELVS/PRE 1.5-Pgal 
plasmid DNA compiexed with Lipofectamine (GIBCO-BRL, Gaithersburg, MD), and 
the level of p-galalactosidase expression was determined 48 hr later. Transfection 
10 efficiencies were determined by direct Xgal staining of transfected monolayers, as 
described in Example 4, and are shown in the table below. 

No. Blue cells/12 mm Dish 



Construct BHK HXLQ&Q 

pCMV-B-gal 549 42 

pBG/wt ELVS 1.5-pgal 146 1 

pBG/wt ELVS/PRE l.5-Pgal 334 33 

Mock 0 0 



The results demonstrate clearly that the number of BHK or HT1080 ceils transfected 
with ELVS plasmids expressing p-galalactosidase was increased dramatically by 

15 inclusion of the RNA transport PRE sequence in the vector. Further, these results 
indicate that one cause for the diminished heterologous gene expression levels in 
HT1080 cells, compared to BHK cells, transfected with ELVS plasmid DNA is the 
inefficient transport of the primary transcript from the nucleus. 

In parallel with the higher frequency of reporter protein expressing 

20 HT1080 cells transfected with ELVS plasmids containing the PRE sequence, the levels 
of p-galalactosidase were dramatically higher in lysates from HT1080 cells transfected 
with ELVS vectors containing the PRE sequence. These results are illustrated in 
Figure 12B, and taken together with the results shown in the table above, demonstrate 
that functional vector replicons are transported inefficiently from the nucleus in human 

25 cells transfected with ELVS plasmids. Further, inclusion of the PRE sequence in the 
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ELVS plasmid construct increases the level of heterologous gene expression in all cells 
tested, demonstrating a clear relationship between efficiency of cytoplasmic vector 
replicon transport and overall heterologous gene expression level. 

Several other viral sequence elements which operate in cis to transport 
5 unspliced RNAs have also been identified. For example, a 219 bp sequence, located 
between nts 8022 and 8240 near the 3' end of the Mason-Pfizer monkey virus (MPMV) 
genome, has been shown to enable Rev independent human immunodeficiency virus 
type 1 (HIV-1 ) replication (Bray et aL PNAS 97:1256-1260. 1994). The MPMV RNA 
transport element known as the constitutive transport element (CTE), is inserted into 

10 the pBG/SfN-1 ELVS 1.5-pgaI plasmid by first isolating a PCR-generated 219 bp 
fragment of MPSV from the full length genomic clone template (Sonigo et al.. Cell 
45:375-385. 1986). or the MPSV subgenornic clone pGEM7FZ(-)MPSV 8007-8240 (D. 
Rekosh. Ham-Rek Laboratories, SUNY at Buffalo. 304 Foster Hall. Buffalo, New 
York). The amplified fragment extends from base 8022-8240 of the MPSV genome. 

1 5 The primer sequences are given below. 

Forward Primer: NMPVM8021F (SEQ. ID NO. 48) 

S'-CCTATGCGGCCGCTAGACTGGACAGCCAATGACG 

20 

Reverse Primer: FMPMV8241R (SEQ. ID NO. 49) 
S'-CCTATTGGCCAGCCAAGACATCATCCGGGCAG 

25 The primers introduce a Not I recognition site at the 5' end of the 

fragment and an Eae I recognition sire at the 3' end. Not I and Eae I have compatible 
sticky ends. The Eae I recognition site is internal to the Not I site, so Eae I cuts both 
sites. 
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The PCR fragment is digested with Eae I and cloned into Not I 
digested/CIAP treated pBG/wt ELVS 1.5-PgaI. The correct clone retains the Not I site 
at the 5* terminus of the PRE and is called pBG/wt ELVS/CTE 1.5-(3gaL 

In addition to the HBV PRE and MPSV CTE sequences, several RNA 

5 transport elements from other viral or cellular sources can be inserted into the ELVS 
plasmid constructs, as described above. For example, some of these elements include 
the HIV Rev responsive element (Malim et al., Nature. 338:254-251, 1989), the HTLV 
1 Rex element (Ahmed et. ah Genes Dev., 4:1014-1022, 1990), and another c/5-acting 
sequence from simian retrovirus type 1 (Zolotukhin et al, J. Virol., 65:7944-7952, 

0 1 994). In addition, each of the above RNA transport elements also may be incorporated 
into the structural protein expression casettes. packaging cell lines, or producer cell 
lines described in Examples 6 and 7. 

In yet another modification of prototype ELVS vectors, expression of the 
alphavirus replicons can be driven from an RNA polymerase I promoter. Briefly, 

5 because RNA polymerase I promoters are not tissue specific and are expressed in 
essentially all human cells in the body, they provide an attractive alternative for plasmid 
DNA-directed alphavirus replicon expression in transfected cells. For example, the 
human rDNA promoter (plasmid prKU3. Learned and Tjian. J. Mol. Appi Gen., 1:515- 
584, 1982). has been used to construct a vector for heterologous gene expression 

0 (Palmer et al., Nuc. Acids Res. 27:3451-3457, 1993). 

Thus, within one embodiment of the invention a RNA polymerase I 
promoter can be juxtaposed with the 5' end of the replicon cDNA such that the first 
nucleotide transcribed in transfected cells corresponds to the authentic alphavirus 5' end. 
Identification of the RNA polymerase I promoter {e.g., plasmid prHU3) nucleotide at 

5 which transcription initiation occurs is determined as described previously (Dubensky et 
al., W/O 95/07994). 

All modifications described herein can be performed with ELVS 
constructs containing the Sindbis virus wild-type or SIN-1 nsPs, or nsP genes from any 
alphavirus. For example, all of the constructions provided in Example 5 can also be 
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performed with plasmid pBG/SIN-1 ELVS 1.5-Pgal, whose construction is described in 
Example 4. 



EXA MPL E < ? 

5 Construction of alphavirus Packaging Cell Lines 

In the present invention, alphavirus packaging cell lines (PCL) are 
provided, whereby the virus-derived structural proteins necessary for RNA packaging 
and formation of recombinant alphavirus vector particles are encoded by one or more 

10 stably transformed structural protein expression cassette(s). Synthesis of these proteins 
preferably occurs in an inducible manner, and in particularly preferred embodiments, 
via transcription of subgenomic mRNA from their native "junction region" promoter. 
Inducible subgenomic transcription is mediated by the input alphavirus vector RNA 
itself (Figure 13). Following primary transcription from the structural protein 

15 expression cassette(s), cytoplasmic amplification of the RNA transcript is initiated by 
vector-encoded nonstructural proteins, and ultimately leads to transcription from the 
junction region promoter and high level structural protein expression. The structural 
protein expression casettes may include any of the previously described elements of the 
present invention, including RNA transport elements (e.g., HBV PRE and MPMV CTE) 

20 and splicing sequences. Such PCL and their stably transformed structural protein 
expression cassettes can be derived using methods described within PCT application 
WO 95/07994, or using novel approaches described within this invention. PCL may be 
derived from almost any existing parental cell type, including both mammalian and 
non-mammalian cells. Preferred embodiments for the derivation of PCL are cell lines 

25 of human origin. 

A. Construction of Vector-Inducible Alphavirus PCL 

For example, an alphavirus structural protein expression cassette was 
constructed, whereby primary transcription from a CMV immediate early promoter 
30 produces an RNA molecule capable of efficient cytoplasmic amplification and 
structural protein expression only after translation of nonstructural replicase proteins 
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from the vector RNA. Specifically, plasmid pDCMV-dlnsPSIN (Dubensky et al., J. 
Virol. 70:508-519. 1996), a DNA-based Sindbis defective helper (DH) vector, was 
modified to contain both a hepatitis delta virus (HDV) antigenomic ribozyme sequence 
(Perotta and Been. Nature 550:434-436, 1991) for 3'-end RNA processing, and an SV40 
5 small t antigen intron inserted within the region of nonstructural protein gene deletion. 
Due to restriction site duplications associated with insertion of the HDV ribozyme 
sequence, an additional plasmid from Dubensky et al. (ibid), pDLTRSINgHDV, was 
used as starting material to reconstruct the modified CMV-based DH construct. 
Plasmid pDLTRSINgHDV, an LTR-based Sindbis genomic clone containing the HDV 

10 ribozvme. was digested with Bgi II to remove the existing LTR promoter and Sindbis 
nucleotides 1-2289 (numbering according to Strauss et al., Virology I33:92- \ 10, 1984), 
treated with calf intestinal alkaline phosphatase, and purified from a 0.7% agarose gel 
using GENE CLEAN II™ (BiolOh San Diego, CA). The corresponding 5'-end 
fragment with a CMV promoter was obtained by Bgl II digestion of the Sindbis 

15 genomic clone pDCMVSINg (Dubensky et al.. ibid) and purification from a 1% agarose 
gel using GENECLEAN II, and then iigated into the Bgl //-deleted pDLTRSINgHDV 
vector to generate the construct pDCMVSINgHDV. This CMV-based genomic plasmid 
with an HDV ribozyme was shown to produce infectious Sindbis vims and cyxopathic 
effect within 24 hr after transfection into BHK cells. Defective helper plasmid 

20 pDCMVdlnsPSINgHDV. containing the HDV ribozyme, was then constructed by BspE 
I digestion and relegation under dilute conditions, to remove nonstructural gene 
sequences between nucleotides 422 and 7054. Subsequently, the SV40 intron was 
synthesized by PCR and inserted into the region of nonstructural protein gene deletion. 
Amplification of the SV40 intron sequence was accomplished by standard three-cycle 

25 PCR with a 30 second extension time, using plasmid pBR322/SV40 (strain 776, ATCC 
#45019) as template and the following oligonucleotide primers that were designed to 
contain flanking BspE I or Bam HI sites. 

Forward primer- RspSVSDF f5'-rest. site/SV40 intron seq.) (SEQ. ID. NO. 50) 
30 5'-TATATATCCGGA/AAGCTCTAAGGTAAATATAAAA 11111 -3' 
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Reverse primer: BamSVSAR f5'-rest. site/S V40 intron secO (SEQ. ID. NO. 51) 
5'TATATAGGATCC/TAGGTTAGGTTGGAATCTAAAATACACAAAC-3' 

5 Following amplification, the DNA fragment was purified using a QIAquick-spin PCR 
purification kit (Qiagen, Chatsworth. CA), digested with BspE I and Bam HI, purified 
from a 1.2% agarose gel using Mermaid™ (Biol OK San Diego, CA), and ligated into 
the defective helper plasmid pDCMV-dlnsPSIN, which was also digested with BspE I 
and Bam HI. treated with calf intestinal alkaline phosphatase, and purified from a 0.7% 

0 agarose gel using GENECLEAN II, to generate the construct pDCMV-imSINrbz 
(Figure 14). Plasmid pDCMV-intSINrbz. which also contains an SV40 promoter- 
driven neomycin resistance selectable marker on another portion of the plasmid, was 
transfected into BHK cells using Lipofectamine™ (Gibco/BRL, Gaithersburg, MD), as 
described by the manufacturer. Approximately 24 hr post-transfection, the cells were 

5 trypsinized and re-plated in media containing 600 ug/ml G418 (neomycin). The media 
was exchanged periodically with fresh G418-containing media and foci of resistant cells 
were allowed to grow. Cells were trypsinized and cloned by limiting dilution in 96 well 
tissue culture dishes, and individual cell clones were grown and expanded for screening. 

Positive packaging activity for the individual clones was identified by 

0 Lipofectin™ (Gibco/BRL. Gaithersburg, MD)-transfection with Sindbis vector RNA 
that expresses a luciferase reporter gene (described in Dubensky et al.. ibid), harvesting 
the culture supematants at approximately 24 hr post-transfection, and assaying for the 
presence of packaged Sindbis-luciferase vector particles. In addition, initial transfection 
levels were determined by harvesting the transfected cell lysates using reporter lysis 

5 buffer (Promega, Madison, WI), and testing for the presence of luciferase activity by 
using luciferin substrate (Promega), as described by the manufacturer. To assay for 
packaged vector panicles in the culture supematants, 1 ml of undiluted, clarified 
supernatant was used to infect fresh BHK cell monolayers for approximately 18 hr. The 
cells were subsequently lysed as above, and luciferase activity was determined. The 

0 presence of luciferase activity in the infected BHK cells was confirmation of packaged 
vector particles in the transfected cell supematants. Several positive cell clones 
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harboring integrated copies of the pDCMV-intSINrbz structural protein expression 
cassette, and functioning as PCL. were identified. Packaging data for two of the 
individual PCL clones (#10s-19 and #10s-22), that were representative of the group, are 
shown in Figure 15. In addition, the titer of packaged vector particles being produced 
5 was determined by transfecting an individual PCL clone (#10s-22) with SIN-P-gal 
vector RNA (described in Dubensky et al., ibid). The culture supernatant was recovered 
at 48 hr post-transfection, clarified by passage through a 0.45 mm filter, and fresh BHK 
monolayers were infected with 10-fold dilutions of the supernatant. Approximately 14 
lir post-infection, the cells were washed with PBS, fixed with 2% formaldehyde, 

10 washed again with PBS, and stained with X-gal. Vector particle units were then 
determined by counting individual blue-stained cells. Packaged (3-gal vector titers from 
this PCL clone were approximately 10 6 infectious units/ml of supernatant. Vector- 
controlled inducibility of Sindbis structural protein expression was demonstrated by 
western blot analysis using a polyclonal rabbit antiserum specific for the structural 

15 proteins. Positive lOs-22 PCL and negative control BHK cell lysates were made in 
Lameli sample buffer, either before (U; uninduced) or after (I; induced) transfection 
with SIN-p-gal vector RNA. As shown in Figure 16, the only cell lysate that showed 
expression of Sindbis structural proteins was the 10s-22 PCL clone after transfection 
with vector RNA. Differences in the apparent levels of expression between the capsid 

20 protein and envelope glycoproteins do not reflect the actual amounts of protein being 
made, rather, the lower stability of the envelope glycoproteins during the cell lysis 
procedure used for this particular experiment. 

Packaging activity of the CMV-based DH construct was also highly 
efficient in non-mammalian cells, for example, C6/36 mosquito cells. The use of such a 

25 non-mammalian parental cell type for derivation of PCL may be particularly 
advantageous when the PCL are intended for subsequent use as starting material for the 
generation of vector producer cell lines. The advantage of this cell type is the natural 
ability of alphaviruses to establish a persistent infection, without the mammalian cell- 
associated phenotype of inhibition of host macromolecular- synthesis and resulting 

30 cytopathic effect (CPE). Thus a DNA-based alphavirus vector (Examples 4 and 5), with 
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an appropriate selectable marker, may be stably transformed into mosquito or other 
non-mammalian cell-derived PCL. Figure 17A shows that both a DNA-based luciferase 
reporter vector and DH helper vector expressing Sindbis structural proteins, under the 
control of the CMV promoter, were fully functional in C6/36 cells, as demonstrated by 
5 luciferase vector packaging. 

B. Construction of PCL wi th operablv-linked selection marker 

In other embodiments of the present invention, a selectable marker is 
operably linked to transcription of the alphavirus structural protein expression cassette. 

10 In preferred embodiments, this operable linkage is accomplished either by insertion of 
the marker into the region of nonstructural protein gene deletion, as a fusion with 
remaining nsPl amino acids, or by insertion downstream of the structural protein genes, 
under the translational control of an internal ribosomal entry site (IRES) sequence. 
Again, amplification of the primary structural protein gene mRNA transcript and 

15 induction of structural protein expression is controlled by the input vector RNA 
molecule and its synthesized nonstructural proteins. 

Specifically, for construction of the structural protein expression 
cassette, plasmid pBGSHl (Spratt et al.. Gene 4/:337-342, 1986; ATCC #37443) was 
modified to remove extraneous sequences, and to render an existing Xlio \ site within 

20 the kanamycin resistance gene non-functional. Plasmid pBGS131 was digested with 
Xho I and a synthetic double-stranded oligonucleotide linker with Xho I-compatible 
ends was ligated into the site. The synthetic 12-mer oligonucleotide, shown below, was 
designed as a partial palindrome that would anneal to itself generating Xho I sticky ends 
for ligation, and maintaining the kanamycin resistance gene open reading frame by 

25 inserting four in-frame amino acids. 

dLYftolinker (SEQ. ID. NO. 52) 
5'-TCGATCCTAGGA 

30 Insertion of this oligonucleotide resulted in a Xlio I site-deleted plasmid, designated 
pBGS131dlA7i£?7. The plasmid was next digested with BspH I and religated to itself 
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under dilute conditions to remove 829 bp of extraneous sequence between the ColEl 
replicon and kanamycin resistance marker, generating the plasmid pBGS131dlB. The 
BspH I site next was changed to a Pac I site by digesting pBGS131dlB with BspH I, 
making the termini blunt with Kienow enzyme and dNTPs. and ligating with excess 
5 Pac I linker. 

Pac I linker (SEQ. ID. NO. 53) 
S'-GCTCTTAATTAAGAGC 

10 This new construct, designated pBGS 1 3 lcilB-P. was further modified by digesting with 
Fsp I and Pvu II to remove an additional 472 bp, including the multiple cloning site 
(MCS) and purifying the remaining vector from a 1% agarose gel using GENECLEAN 
II. A replacement MCS was inserted into the modified vector by annealing two 
complimentary oligonucleotides. PME.MCSI and PME.MCSIL and ligating with the 

15 linear plasmid. 

PME.MCSI (SEQ. ID. NO. 54) 

5'-CTGTTTAAACAGATCTTATCTCGAGTATGCGGCCGCTATGAATTCGTTTAAACGA-3' 

20 PME.MCSII (SEQ. ID. NO. 55) 

5 , -TCGTTTAAACGAATTCATAGCGGCCGCATACTCGAGATAAGATCTGTTTAAACAG-3' 

The new, approximately 2475 bp, cloning vector was designated 
pBGSVG, and contained the following multiple cloning site: <Pme I - Bgl II - XJio I - 

25 Not I - EcoR I - Pme />. Insertion of the structural protein expression cassette 
containing an operably linked selectable marker proceeded stepwise, as follows. A 
DNA fragment comprising the 3'-end of Sindbis virus, a synthetic A40 tract, the 
antigenomic HDV ribozyme, and a BGH transcription termination signal, was removed 
from plasmid pBG/STN-l ELVS 1.5 (Example 5) by digestion with Not I and EcoR L 

30 and purification from a 1% agarose gel using GENECLEAN II. Plasmid pBGSVG also 
was digested with Not I and EcoR /, purified from a 1% agarose gel using 
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generate the construct pBGSV3\ Next, an approximately 9250 bp Sindbis cDNA 
fragment, containing the structural protein genes and much of the nonstructural protein- 
encoding region, was removed from plasmid pDLTRSFNg (Dubensky et at., ibid) by 
5 digestion with Bgl II and Fsp I, and purified from a 0.7% agarose gel using 
GENECLEAN II. The Sindbis cDNA fragment was then ligated into plasmid 
pBGSV3\ which was also digested with Bgl II and Fsp I, treated with alkaline 
phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. The new 
construct was designated pBGSV3'BF. Subsequently, this construct was digested with 

10 Bgl II. treated with alkaline phosphatase, and purified with GENECLEAN II for 
insertion of remaining 5'-end and nonstructural gene sequences, along with a CMV IE 
promoter. The remaining sequences were obtained by digestion of 
plasmidpDCMVSINs (Dubensky et al.. ibid) with Bgl IL purification of the fragment 
from a 1% agarose gel using GENECLEAN II, and ligation with the linear pBGSV3'BF 

15 vector, to create the CMV-driven Sindbis genomic construct. pBGSVCMVgen. 
Functionality of this construct for initiation of the Sindbis virus replication cycle was 
determined by Lipofectamine-mediated transfection of pBGSVCMVgen plasmid into 
BHK cells, and the observance of CPE within 24 hr post-transfection. 

Plasmid pBGSVCMVgen was subsequently used to construct a DH 

20 structural protein expression cassette by deleting most of the nonstructural protein gene 
sequences and inserting a neomycin resistance gene as an in-frame fusion with 
remaining codons of the nsPl open reading frame. Briefly, the neomycin resistance 
gene was amplified by standard three-cycle PCR from the pcDNA3 vector (Invitrogen, 
San Diego, CA), using the following oligonucleotide primers that were designed to 

25 contain flanking BspE I and BamH I sites. 

Forward primer: NEQ5TUSE fS'-rest. site/neo gene^ (SEQ. ID. NO. 56) 
5'-ATATATCCGGA/GTCCGGCCGCTTGGGTGGAGAGGCTA 
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Reverse primer: NEO.VBA M f .V-rest. site/neo gene^ (SEQ. ID. NO. 57) 
5'-ATATAGGATCC/TCAGAAGAACTCGTCAAGAAGGCGA 



Following amplification, the DNA fragment was purified with QIAquick-spin, digested 
5 with BspE I and BamH I, purified using GENECLEAN II. and ligated into piasmid 
pBGSVCMVgen that had also been digested with BspE I and BamH I, treated with 
alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. 
The resulting construct was designated pBGSVCMVdlneo, and is shown schematically 
in Figure 14. The configuration of pBGSVCMVdlneo includes, as part of the structural 
0 protein expression cassette and controlled by the same CMV promoter, a fusion protein 
comprising the initiator methionine and amino-terminal 121 amino acids of nsPl and 
the neomycin resistance gene lacking its methionine initiator codon and next ten amino 
acids. 

Piasmid pBGSVCMVdlneo was transfected into BHK cells using 

5 Lipofectamine, as described by the manufacturer. Approximately 24 hr post- 
transfection, the cells were trypsinized and re-plated in media containing 600 ug/ml of 
the drug G418 (neomycin). The media was exchanged periodically with fresh G41S- 
containing media and foci of resistant cells were allowed to grow. Cells were 
trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 

0 individual cell clones were grown and expanded for screening. Positive packaging 
activity for the individual clones was identified by transfecting with Sindbis luciferase 
vector RNA and assaying for the presence of packaged Sindbis-luciferase vector 
panicles as described in the previous section. Several positive cell clones harboring 
integrated copies of the pBGSVCMVdlneo structural protein gene expression cassette, 

5 and functioning as PCL, were identified. SNBS™-luciferase vector packaging data for 
individual clones (Fll, F13, F15) that are representative of the group, as well as the 
previously described I0s-22 PCL line, are shown in Figure 18. 

In addition to demonstrating functional packaging activity with Sindbis- 
lucifease vectors, additional experiments performed using the same PCL also showed 

0 that vectors derived from other alphaviruses also could be packaged. For example, both 
Sindbis (Dubensky et al., ibid.) and Semliki Forest (pSFV3-/acZ; GIBCO BRL, 
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Gaithersburg, MD) vector RNAs expressing p-galactosidase, were transfected into the 
F15 packaging cell line. Approximately 48 hr post-transfection, the culture 
supernatants were harvested, clarified, diluted serially, and used to infect fresh BHK 
cell monolayers for determination of vector panicle titers. At 18 hr post-infection, the 
5 BHK cells were fixed, stained with X-gal, and the blue-staining cells were counted, as 
described previously. The vector titers obtained for Sindbis p-gal were approximately 
5xl0 6 rU/ml, while the titers for SFV p-gal were approximately 4xl0 6 IU/ml. These 
data demonstrate that the two different alphaviruses and their corresponding vectors 
have similar packaging signals and that PCL derived for the Sindbis systems described 

10 herein are fully functional when used with another alphavirus. 

Packaging activity of the pBGSVCMVdlneo construct also was highly 
efficient in cells of human origin, for example, 293 cells. The use of such a human 
parental cell type for derivation of PCL may be particularly advantageous in the 
generation of complement resistant recombinant alphavirus panicles. Figure 17B 

15 shows that both RNA and DNA-based luciferase reporter vectors were efficiently 
packaged following transfection into G418 resistant, pBGSVCMVdlneo-transformed 
293 PCL. as demonstrated by supernatant transfer of luciferase expression into BHK 
ceils. 

Another selectable drug-resistance marker aiso was shown to function in 
20 a similar PCL configuration, as a fusion protein with remaining nsPl amino acids at its 
N-terminus. Briefly, the hygromycin phosphotranferase gene (hygromycin resistance 
marker, hygro r ) was substituted into plasmid pBGSVCMVdlneo, in place of the existing 
neomycin resistance marker. The hygro r gene was amplified by standard three-cycle 
PCR from plasmid p3'SS (Stratagene, La Jolla, CA), using the following 
25 oligonucleotide primers that were designed to contain flanking EcoRV and BamHl sites. 

Forward primer- VHYGRQEV (5'-rest. site/h v gro gene> (SEQ. ID. NO. 1 12) 
5'-TATATGATATC/AAAAAGCCTGAACTCACCGCGACG 
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Reverse primer: 3'HYGROBA rS'-rest. site/livgro gene) (SEQ. ID. NO. 1 13) 
S'-ATATAGGATCC/TCAGTTAGCCTCCCCCATCTCCCG 



Following amplification, the DNA fragment was purified with QIAquick-spin, digested 
5 with EcoRV and BamHl, purified using GENECLEAN, and ligated into plasmid 
pBGSVCMVdlneo that had been digested with BspEl, blunt-ended with Klenow, 
digested further with BamHl, treated with alkaline phosphatase, and purified from a 
0.7% gel using Geneclean. The resulting construct was designated pBGSVCMVdlhyg. 

Plasmid pBGSVCMVdlhyg was transfected into BHK cells using 

10 Lipofectamine, as described by the manufacturer. Approximately 24 hr post- 
transfection. the ceils were trypsinized and re-piated in media containing 1.2 mg/ml of 
the drug hygromycin (Boehringer Mannheim). The media was exchanged periodically 
with fresh hygromycin-containing media and foci of resistant cells were allowed to 
grow into a pool. Functionality of the selected packaging cells was demonstrated by 

15 transfecting with Sindbis luciferase vector RNA and assaying for the presence of 
packaged Sindbis-luciferase vector particles as described in the previous section. 
Positive results from these packaging experiments are shown in Figure 39. 

In an alternative packaging cell line structural protein expression 
cassette, the selectable marker (in this case neomycin resistance) was inserted 

20 downstream of the Sindbis structural protein genes and under the translational control of 
an internal ribosome entry site (IRES). Thus, transcription of the mRNA encoding 
neomycin resistance occurs both at the genomic level (from the RSV promoter) and also 
from the subgenomic junction region promoter. Additional features unique to this 
construct include the Rous sarcoma virus (RSV) LTR promoter for primary 

25 transcription and a iRNA Asp 5'-end sequence derived from Sindbis defective-interfering 
RNA clone DI25 (Monroe and Schlesinger, Proc. Natl. Acad. Set USA $0:3279-3283, 
1983). This particular PCL expression cassette configuration was designated 
987DHBBNeo, and is shown schematically in Figure 14. Specifically, plasmid 
987DHBBNeo may be constructed stepwise using the modified plasmid vector 

30 pBGS131dlB-P (described above) as starting material. A cDNA fragment containing 
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the junction region promoter, the structural protein gene sequences, and 3 , -untranslated 
region + polyA is obtained by digestion of the full-length Sindbis cDNA clone pRSINg 
(Dubensky et al.. ibid) with BamH I and Xba /, and purification from a 0.7% agarose gel 
using GENECLEAN II. The Sindbis cDNA DNA fragment is ligated with plasmid 
5 vector pBGS13 IdLB-P that also has been digested with BamH I and Xba /, treated with 
alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. to 
generate the construct pBGSINsp. 

Next, the transcription termination signal from the SV40 early region is 
inserted between the Sac I and Eco RI sites of pBGSINsp, immediately downstream of 
10 the Sindbis sequence. The SV40 viral nucleotides 2643 to 2563, containing the early 
region transcnption termination sequences, are isolated by PCR amplification using the 
primer pair shown below and the pBR322/SV40 plasmid (ATCC # 45019), as template. 

Forward primer- FSVTT2643 (5'-rest. site/SV40 nts 2643-261 3) (SEQ. ID. NO. 58) 
1 5 5'-TATATATGAGCTCTTAC AAATAAAGCAATAGCATCACAAATTTC 

Reverse primer: RSVTT2563 (rest. site/SV40 nts 2563-2588) (SEQ. ID. NO. 59) 
5'-TATATGAATTCGTTTGGACAAACCACAACTAGAATG 

20 The primers are used in a standard three-cycle PCR reaction with a 30 

second extension period. The amplification products are purified with QIAquick-spin, 
digested with Sac I and Eco RI. purified again with the Mermaid kit, and the 90 bp 
fragment is ligated into plasmid pBGSINsp that also has been digested with Sac I and 
EcoRI, treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 

25 GENECLEAN II. This construction is known as pBGSINspSV. 

Next the RSV promoter and Sindbis 5'-end sequences, including the DI 
tRNA Asp structure, are assembled by overlapping PCR and the entire fragment is 
inserted into the structural protein gene vector pBGSINspSV. In PCR reaction #1, the 
RSV promoter fragment is amplified by standard three cycle PCR, with a 1 minute 

30 extension, from an RSV promoter-containing template plasmid (e.g. pRc/RSV, 
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Invitrogen, San Diego, CA), using the following oligonucleotide primers that are 
designed to also contain a flanking Bgl II site in one primer and sequences overlapping 
the tRNA 5'-end in the other. The Sindbis tRNA 5'-end is positioned immediately 
adjacent to the RSV promoter transcription start site. 

5 

Forward primer 5'RSVpro (5'-rest. site/RSV seq/> (SEQ. ID. NO. 60) 
5*-TATATAGATCT/AGTCTTATGCAATACTCTTGTAGT 

Reverse pnmer: TRSVtR fS'-Sin tRNA seq/RSV seq.) (SEQ, ID. NO. 61) 
10 S'-GGGATACTCACCACTATATCTCGACGGTATCGAGGTAGGGCACT 

In PCR reaction #2. the Sindbis 5'-end plus tRNA sequence is amplified by standard 
three cycle PCR with a 1 minute extension, from template plasmid 
TotollOI(5'tRNA Asp ) (Bredenbeek et al., J. Virol 67:6439-6446, 1993), using the 
15 following oligonucleotide primers that are designed to also contain a flanking BamH I 
site in one primer and sequences overlapping the 3'RSVtR primer in the other. 

Forward pnmer 5'tRNASin fS'-Sindbis + tR NA seq. only) (SEQ. ID. NO. 62) 
5'-GATATAGTGGTGAGTATCCCCG 

20 

Reverse primer: VSinBam G'-rest. site/Sindbis seq.) (SEQ. ID. NO. 63) 
5'-TATATGGATCC/AGTACGGTCCGGAGATCCTTAATCTTCTCATG 

Following amplification, the DNA fragments are purified with QIAquick-spin and used 
25 together as templates in a subsequent three-cycle PCR reaction with. 2 minute extension, 
using additional 5'RSVpro and 3 , SinBam primers. The resulting overlapping PCR 
amplicon is purified using GENECLEAN II, digested with Bgl II and BamH I, and 
ligated into plasmid pBGSrNspSV that also has been digested with Bgl II and BamH I, 
treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 
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GENECLEAN II. The resulting structural protein expressing, defective helper is 
designated 987DHBB. 

Next, the IRES sequence from encephalomycodarditis virus (EMCV), is 
positioned immediately upstream of the neomycin phosphotransferase gene, as a 
5 selectable marker, by overlapping PCR, and the entire amplicon is inserted into the Nsi 
I site of 987DHBB. Insertion at the Nsi I site will position the selectable marker 
immediately downstream of the structural protein ORE. In PCR reaction #1 . the EMCV 
IRES fragment (nucleotides 260-827) is amplified by standard three cycle PCR, with a 
30 second extension, from template plasmid pBS-ECAT (Jang et aL J. Virol 6J:1651. 
10 1989), using the following oligonucleotide primers that are designed to also contain a 
flanking Nsi I site in one primer and sequences overlapping the neo gene in the other. 

Forward primer: 5'EMCVIRES f5'-rest. site/EMCV seq.) (SEQ. ID. NO. 64) 
5'-TATATATGCAT/CCCCCCCCCCCCCAACG 

15 

Reverse primer: TEMCVIRES f5'-pcDNA + neo seq/EMCV seq.) (SEQ. ID. NO. 65) 
5'-CATGCGAAACGATCCTCATC/CTTACAATCGTGGTTTTCAAAGG 

In PCR reaction #2, the neo resistance marker is amplified by standard three cycle PCR 
20 with a 1.5 minute extension, from template plasmid pcDNA3 (Invitrogen, San Diego, 
CA), using the following oligonucleotide primers that are designed to also contain a 
flanking Nsi / site in one primer and sequences overlapping the 3'EMCVIRES pnmer in 
the other. 

25 Forward primer: 5'Neo/pcPNA (5'-pcDNA + neo seq. onM (SEQ. ID. NO. 66) 
5'-GATGAGGATCGTTTCGCATGATTGA 

Reverse primer: 3'Neo/pcDNA f3'-rest. site/neo seq.) (SEQ. ID. NO. 67) 
5'-TATATATGCAT/TCAGAAGAACTCGTCAAGAAGGCGA 

30 
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Following amplification, the DNA fragments are purified with QIAquick-spin and used 
together as templates in a subsequent three-cycle PCR reaction with 2 minute extension, 
using additional 5'EMCVIRES and 3'Neo/pcDNA primers. The resulting overlapping 
PCR amplicon is purified using GENECLEAN II, digested with Nsi L and ligated into 
5 plasmid 9S7DHBB that also has been digested with Nsi /, treated with alkaline 
phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. The 
resulting structural protein expression construct, with the IRES/neo insert in the Sindbis 
3'-untranslated region, is designated 9S7DHBBNeo. 

To generate stable packaging cell lines, BHK cells were transfected with 

10 10 ug of plasmid 9S7DHBBNeo. using a standard calcium phosphate precipitation 
protocol. Approximately 24 hr post-transfection, the media was replaced with fresh 
media containing 1 mg/ml of the drug G418. After one additional day, the cells were 
trypsinized and re-plated at 1/10 density in media containing 500 ug/ml G418. After 
several more passages, the cells were subjected to dilution cloning and individual clones 

1 5 were expanded. The ability of individual clones to function as packaging cell lines was 
determined by calcium phosphate transfection of plasmid RSV/Sinrep/LacZ, a Sindbis 
DNA vector expressing (3-gal, and assaying for the presence of packaged vector 
panicles in the supernatams after 48 hr. The packaged vector replicons were titered by 
the CPE assay described in Froiov and Schlesinger {J. Virol. 65:1721-1727, 1994) and 

20 one that gave high titers of packaged panicles, designated 987DH-BBNeo, was used for 
further characterization. Packaged vector titers were determined at 48 hr, following 
transfection of either RNA- or DNA-based Sindbis vectors expressing {3-gal, using 
several different transfection techniques. The results were as follows: 



transfection procedure 

electroporation 

electroporation 

Lipofectamine 
Lipofectin 
Calcium Phosphate 



nucleic acid added 
RSVSINrep/LacZ DNA (2.5 ug) 

SINrep/LacZ RNA (2.5 ug) 
RSVSINrep/LacZ DNA (2 ug) 
SINrep/LacZ RNA (2 ug) 

RSVSINrep/LacZ DNA (10 ug) 



titers finfec tious units/ml) 

1.5 x I0 9 /ml 

6x l0 9 /ml 
no packaged particles 

5-6 x 10 7 /ml 

1.5 x l0 9 /ml 
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In addition, SINrep/LacZ panicles that were packaged using 987DH-BB 
cell lines subsequently were used to infect fresh BHK cell monolayers and examine 
both RNA and protein expression patterns. Figure 19 shows the RNA pattern after 
5 BHK cells were infected with two different preparations of SINrep/LacZ panicles at a 
MOI of 150 infectious units per cell (lanes I and 2), or wild-type Sindbis virus (lane 3), 
as a control. Seven hours post-infection, dactinomycin (1 ug/ml) and [ 3 H]uridine (20 
uCi/ml) were added, followed by harvest and analysis of RNA 4 hr later, according to 
Bredenbeek et al. (J. Virol. 67:6439-6446, 1993). The high MOI was used in order to 

10 detect possible recombinants. Horizontal lines to the right of the gel lanes indicate the 
Sindbis and p-gal RNAs of interest. The highest molecular weight band indicates the 
genomic RNA of the replicon or virus (lanes 1 and 2, SINrep/LacZ; lane 3 Sindbis 
virus). The next two RNAs indicated are the genomic RNA of the 987DH-BBNeo PCL 
expression cassette and the inducible subgenomic structural protein mRNA from the 

15 same 987DH-BBNeo PCL cassette. The presence of the latter two bands demonstrates 
that the helper genomic RNA derived from the packaging cell line is also co-packaged. 
The next RNA bands, those present in greatest abundance, are the subgenomic RNAs 
derived from either SINrep/LacZ (lanes 1 and 2) or the Sindbis virus genome (lane 3). 

Protein analysis was performed following infection of BHK 21 cells with 

20 packaged SINrep/LacZ replicons at a MOI of 20 infectious units/cell. Fifteen hours 
post-infection, the cells were labeled with [ 35 S] methionine for 30 minutes, iysates 
made, and the proteins analyzed by SDS-PAGE. As shown in Figure 20 (lanes 2 and 
3), both beta-galactosidase and the Sindbis virus capsid protein are labeled in the vector 
particle-infected cells, but not in uninfected cells (lane 1). The presence of capsid 

25 shows that some of the packaged panicles also contain structural protein gene RNA 
transcripts from the PCL. 

C. Construction of "split structural gen e" PCL configurations 

In other embodiments of the present invention, PCL are provided 
30 wherein the alphavirus structural proteins are expressed, not as a polyprotein from a 
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single mRNA, with its native post-translational processing,, but rather, as separate 
proteins from independent mRNAs that are transcribed via multiple cassettes. This 
approach is depicted schematically in Figure 21 A. Such a configuration greatly 
minimizes the possibility of recombination or co-packaging events that lead to 
5 formation of replication-competent or infectious virus. In preferred embodiments, the 
capsid protein is expressed from one stably transformed cassette and the envelope 
glycoproteins are expressed together from a second stably transformed cassette, and 
each is expressed in a vector-inducible manner from the junction region promoter 
(described above), 

10 . For example, the Sindbis virus capsid protein gene was amplified from 

plasmid pDLTRSINg (Dubensky et al.. ibid), by standard three-cycle PCR with a 1.5 
minute extension, using the following oligonucleotide primers that were designed to 
contain a flanking XJw I site and capsid protein gene initiation codon or a flanking Not I 
site and translation stop codon. 

15 

Forward primer- STN.S'CXho f5'-rest. site/capsid seq.) (SEQ. ID. NO. 68) 
5'-ATATACTCGAG/ACCACCACCATGAATAGAGGATTC 

Reverse primer: SIN3'CNot f5'-rest. site/stop codorv'capsid seq.t (SEQ. ID. NO. 69) 
20 5'-TATATGCGGCCGC/TATTA/CCACTCTTCTGTCCCTTCCGGGGT 

Following amplification, the capsid DNA fragment was purified with QIAquick-spin. 
digested with XJw I and Not I. purified using GENECLEAN II, and ligated into the 
DNA-based Sindbis expression vector pDCMVSIN-luc (Dubensky et al., ibid), that also 

25 had been digested with Xho I and Not I to remove its luciferase reporter gene insert, 
treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 
GENECLEAN II. The resulting capsid protein expression construct was designated 
pDCMVSIN-C. Plasmid pDCMVSIN-C was subsequently digested with BspE I to 
remove most nonstructural protein gene sequences, and re-ligated to itself under dilute 

30 conditions to create the DH vector construct, pDCMVSINdl-C. 
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Alternatively, the vector backbone may be first modified to contain the 
RSV promoter and/or 5 '-end tRNA sequences described previously for 987DHBB. 
Specifically, this was accomplished by step-wise replacements using plasmid PBGSV3 1 
(see above) as starting material. The junction region promoter plus Xlxo I and Not I 
5 cloning sites were obtained as a luciferase reporter-containing fragment from 
pDCMVSIN-luc (see above). Plasmid pDCMVSIN-luc was digested with Bam HI and 
Fsp I, and the luciferase reporter-containing fragment was purified from a 0.7% agarose 
gel using GENECLEAN II. The fragment was ligated into plasmid pBGSV3' that also 
had been digested wiih Bam HI and Fsp I, and treated with alkaline phosphatase to 

10 produce a plasmid designated pBGSV3'BaFLuc. The RSV promoter/5' -end tRNA 
sequence was then obtained from 987DHBB by digestion with Bgl II and Bam HI and 
purification from a 1% agarose gel using GENECLEAN II. This fragment was ligated 
into pBGSV3 , BaFLuc that was similarly digested with Bgl II and Bam HI, to produce 
the construct pBRSV987dl-Luc, which may be used as starting material for either 

1 5 capsid or envelope glycoprotein expression constructs. 

To generate a capsid gene expression construct with the RSV promoter 
and tRNA 5'-end sequence, the existing luciferase reporter gene insert was removed by 
digestion w'rth. XJw I and Not I, and replaced with a PCR-amplified capsid protein gene 
(see above), that also was digested with Xlxo I and Not I. The resulting construct was 

20 designated pBRSV987dl-C. Insertion of a neomycin phosphotransferase selectable 
marker into the region of nonstructural protein gene deletion was accomplished by 
digestion with BspE I and Bam HI, and replacement with a PCR-amplified neo r gene 
(see above) that also was digested with BspE I and Bam HI, and purified from a 1% 
agarose gel. The resulting construct was designated pBRSV987dlneo-C and is shown 

25 schematically in Figure 2 IB. 

Plasmids pDCMVSINdl-C and pBRSV987dlneo-C, which contain 
neomycin resistance selectable markers, were transfected into BHK-21 cells using 
Lipofectamine, as described by the manufacturer. Approximately 24 hr post- 
transfection, the cells were trypsinized and re-plated in media containing 600 ug/ml of 

30 the drug G418 (neomycin). The media was exchanged periodically with fresh G418- 
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containing media and foci of resistant cells were allowed to grow. Cells were 
trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 
individual cell clones were grown and expanded for screening. Cells which inducibly 
expressed capsid protein in response to input vector were identified by transfecting with 
5 Sindbis luciferase vector RNA or Sindbis P-galactosidase DNA vectors, making cell 
lysates approximately 24 or 48 hr post-transfection, and performing western blot 
analysis with a rabbit anti-Sindbis polyclonal antibody. Several positive cell clones 
harboring integrated copies of the capsid protein gene expression cassette and inducibly 
expressing the protein were identified and are shown in Figure 2 ID. 

10 In order to demonstrate both inducibility and functionality of the 

expressed capsid in the context of "split structural gene" cassettes, an additional 
construct that expressed the Sindbis virus envelope glycoproteins was generated from 
pDCMVSfN-luc. Briefly, the Sindbis envelope glycoprotein genes were amplified from 
plasmid pDLTRSINg by standard three-cycle PCR, with a 2.5 minute extension, and 

15 using the following oligonucleotide primers that are designed to contain a flanking XJw 
I site and translation initiation codon in good Kozak context, or a flanking Not I site and 
the translation stop codon. 

Forward pnmer: 5'GLYCO-X f 5'-rest. site/initiation codon/'elycoprotein seq.) 

20 (SEQ. ID. NO. 70) 

S'-ATATACTCGAG/AGCAATG/TCCGCAGCACCACTGGTCACGGCA 

Reverse primer: 3'GLYCO-N f5'-rest. site/glycoprotein seq.) (SEQ. ID. NO. 71) 
S'-ATATAGGCGGCCCC/TCATCTTCGTGTGCTAGTCAGCATC 

25 

Following amplification, the glycoprotein gene DNA fragment was purified with 
QIAquick-spin, digested with Xlw I and Not I, purified using GENECLEAN II, and 
ligated into the DNA-based Sindbis expression vector pDCMVSIN-luc, that also had 
been digested with XJw I and Not I to remove its luciferase reporter gene insert, treated 
30 with alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN 
II. The resulting glycoprotein expression construct was designated pDCMVSINl.SPE. 
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Figure 21C is a western blot demonstrating vector controlled inducibiiity 
of two different clonal capsid lines (9-3 and 9-9) cell lines, that were transfected with 
Sindbis DNA vectors expressing the envelope glycoproteins (1.5PE lanes) or a p- 
galactosidase reporter, pDCMVSIN-p-gal (Dubensky et al., ibid ; 1.5p-gal lanes), or 
5 "mock" transfected (M lanes), using Lipofectamine. Cell lysates were made at 48 hr 
post-transfection, separated by SDS-PAGE. and transferred to membranes, where they 
were probed with a combination of antibodies specific for Sindbis structural proteins 
and p-galactosidase. The blot clearly shows the inducibiiity of capsid protein in 
response to the nonstructural proteins supplied by either vector, as well as the 

10 expression of p-galactosidase and the envelope glycoproteins. Functionality of the 
"split structural gene" capsid cell lines, by complementation and vector panicle 
packaging, was demonstrated by co-transfecting the (3-galactosidase and envelope 
glycoprotein vectors into a capsid cell line using Lipofectamine, and assaying for 
packaged panicles in the culture supernatants. Approximately 48 hr post-transfection, 

15 the supernatants were harvested and clarified for the packaging assays and vector titer 
determination. In addition, the cells were lysed using Lameli sample buffer and 
examined by western blot analysis with polyclonal anti-Sindbis antibody, demonstrating 
expression of both capsid protein and the vector supplied envelope glycoproteins. The 
supernatants were then tested for the presence of packaged vector panicles by infecting 

20 naive BHK cells for approximately 18 hr, and staining for p-gal reporter gene 
expression, as described previously in this example. Functionality of the cell lines for 
complementation and packaging was demonstrated by the observance of blue-stained 
p-gal expressing cells. 

To generate stable "split structural gene" PCL that have separate vector 

25 inducible expression cassettes for both capsid protein and the envelope glycoproteins, 
any of the above described capsid cell lines may be used, in conjunction with an 
additional envelope glycoprotein expression construct that contains a different 
selectable marker (for example, hygromycin B resistance). In one example, 
pBRSV987dI-Luc was used as starting material to generate a glycoprotein gene 

30 expression construct with the RSV promoter and tRNA 5'-end sequence. The existing 
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luciferase reporter gene insert of pBRSV987dl-Luc was removed by digestion with Xlw 
I and Not I, and replaced with a PCR-amplified glycoprotein gene (pE2/El) product 
(see above), that also was digested with Xlw I and Not I. and purified from a 0.7% 
agarose gel. The resulting construct was designated pBRSV987dl-Glyco. Insertion of a 
5 hygromycin phosphotransferase selectable marker into the region of nonstructural 
protein gene deletion was accomplished by digestion of plasmid pBRSV9S7dl-Glyco 
with BspE I. blunt-ending with Klenow, and further digesting with Bam HI. The 
hygromycin' insert was obtained as a PCR-amplified product (see above) that was 
digested with EcoR V and BamH I, and ligated into the prepared pBRSV987dl-Glyco 

10 vector. This construct was modified further to include an RNA export element. The 
PRE sequence was inserted by first isolating a PCR-generated 564 bp fragment of HBV 
from the full-length genomic clone of the ADW viral strain, pAM6 (ATCC No. 39630), 
as described in Example 5. Following amplification and purification, the purified HBV 
PRE fragment was cloned into the pCR-Blunt (INVITROGEN, San Diego, CA) 

15 plasmid vector, to yield the construct pHBV-PRE. The HBV PRE element then was 
isolated from pHBV-PRE by digestion with Not I and 2% agarose gel electrophoresis, 
and ligated into the hyromycin reistance marker-containing construct derived from 
pBRSV987dl-Glyco, that was also digested with Not I and treated with CIAP, to yield 
the final construct, pBRSV987dlhyg-Glyco. 

20 In certain embodiments, it may be desirable to also include a translation 

enhancement element that may derived from capsid gene sequences of homologous or 
heterologous alphaviruses. For inclusion of a Ross River virus translation enhancer, an 
appropriate sequence may be obtained from the DH-BB CA3rrv construct described in 
example 8. Specifically, DH-BB CA3rrv was digested with Bam HI and Bsi WI, and 

25 a fragment containing the junction region promoter, Ross River virus translation 
enhancer, and the amino terminal sequences of the pE2 gene, was isolated using a 1.2% 
agarose gel and GENECLEAN II. This fragment was ligated into plasmid 
pBRSV987dlhyg-Glyco that was similarly digested with Bam HI and Bsi WI, to 
produce the expression cassette designated pBRSV987dlhyg-rrv-Glyco, and shown 

30 schematically in Figure 2 IB. 
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Alternatively, plasmid pBG/SIN-1 ELVS 1.5-(3-gal (Example 5) may be 
used as starting material, by digestion with BamH I and Fsp i to isolate the sequences 
comprising the junction region promoter, the [3-gal reporter gene, and some 3'-end 
sequences. The desired fragment is purified from a 1% agarose gel using 
5 GENECLEAN II, and ligated into plasmid pBGSVCMVdlneo (see above) that also has 
been digested with Bam HI and Fsp I to eliminate all structural protein gene sequences, 
treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 
GENECLEAN II. The resulting construct is designated as pBGSVCMVdlsP-luc. 
Plasmid pBGSVCMVdlsP-luc is next digested with X)w I and Not I to remove the 

10 luciferase reporter gene, treated with alkaline phosphatase, and purified from a 0.7% 
agarose gel using GENECLEAN II. and the Xlio I- and Not I-digested envelope 
glycoprotein PCR amplicon from above is subsequently ligated into the digested 
pBGSVCMVdlsP-luc vector to generate the envelope glycoprotein expressing DH 
construct. pBGSVCMVdl-G. Insertion of a hygromycin resistance marker cassette into 

15 this plasmid. as well as flanking HSV TK promoter and polyadenylation sequences, is 
accomplished by PCR amplification, using a standard three-cycle protocol with 2.5 
minute extension, plasmid pDR2 (Clontech, Palo Alto, CA) as template, and the 
following oligonucleotide primers that are designed to contain flanking Pac I sites. 

20 Forward primer: 5'HYGRO/Pro-P (5'-rest. site/pDR2 seq.) (SEQ. ID. NO. 72) 
5'-ACACATTAATTAA/CGATGCCGCCGGAAGCGAGAA 

Reverse primer THYGRO/pA-P (5'-rest. site/pDR2 seq.l (SEQ. ID. NO. 73) 
S'-ACACATTAATTAA/GTATTGGCCCCAATGGGGTCT 

25 

Following amplification, the DNA fragment is purified with QlAquick-spin, digested 
with Pac I, purified using GENECLEAN II, and ligated into plasmid pBGSVCMVdl-G 
that also has been digested with Pac I, treated with alkaline phosphatase, and purified 
using GENECLEAN II. The resulting construct is designated as pBGSVhygro-G. 
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Plasmid pBRSV987dlhyg-rrv-Glyco, which contains a hygromycin 
selectable marker, was transfected into a clonal capsid cell line using Lipofectamine, as 
described by the manufacturer. Approximately 24 hr post-transfection, the cells were 
trypsinized and re-plated in media containing 500 ug/ml of hygromycin (Calbiochem ; 
5 La Jolla, CA). The media was exchanged periodically with fresh hygromycin- 
containing media and foci of resistant cells were allowed to grow. Cells were 
trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 
individual cell clones were grown and expanded for screening. Split structural gene 
PCL derived in this manner were designated C/GLYCO PCL. Positive cells which 

10 inducibly express biologically active capsid protein and envelope glycoproteins in 
response to input vector were identified in two ways. Initially, transfer of expression 
experiments were performed to demonstrate that transfected vector molecules could 
induce structural protein expression, resulting in packaging and secretion of vector 
panicles that could in turn be used to infect naive cells. Sindbis virus plasmid DNA 

15 vectors expressing [3-galactosidase were transfected into panels of potential C/GLYCO 
PCL clones derived from two independently selected pools (Figure 26B, pools C and 
E). At 48 hr post-transfection. supernatants were harvested and used to infect naive 
BHK-21 ceils for an additional 18 hr. Infected cell lysates were harvested and 
enzymatic p-galactosidase activity determined. As shown in the figure, several clones 

20 were able to package vector, resulting in the high level transfer of vector to naive cells. 
In a second experiment, transfected PCL were lysed and subjected to western blot 
analysis as described previously. As shown in Figure 26C, induction of both capsid and 
envelope glycoprotein occurs after introduction of vector into the PCL. 

25 D. Construction of PCL with "hybrid" st ructural proteins 

An additional approach which may be utilized to decrease the level of 
co-packaging or recombination between DH and vector RNA molecules, to enhance 
translation of the glycoprotein genes, or to alter the cell or tissue specificity of the 
packaged recombinant alphavirus vector particles, makes use of structural protein genes 

30 derived from other alphaviruses or togaviruses. More specifically, numerous 
combinations of alphavirus or togavirus structural protein genes for use with Sindbis 
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vims or different alphavirus vectors can be envisioned. For example, the capsid protein 
gene of Ross River virus (RRV), may be used in conjunction with the envelope 
glycoprotein genes of Sindbis virus (expressed from the same or a different construct), 
to package a Sindbis virus-derived vector described in examples 3. 4, or 5. In addition, 
5 a deleted form of the RRV capsid protein gene may be positioned immediately 
upstream of the Sindbis glycoprotein gene sequences to serve as a translational 
enhancer elements. As another example, the structural proteins of Sindbis virus may be 
used to package Semliki Forest virus RNA vectors. 

Specifically, defective helper (DH) structural protein constructs that 
contain an intact or deleted form of 
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3. RRV Nci2: (m.7616-76301 (SEQ. ID. NO. 76) 
5'-ccacggatCCCGGCGTTCCGTCC 

5 4. RRV Apal : fnt.8088-8102) (SEQ. ID. NO. 77) 
5'-ccacaagcttGTGCACTGGGATCTG 

5. RRV Apal: (nt.8097-81 1 H (SEQ. ID. NO. 78) 

5'-ccacggatccGTGCACATGAAGTCC 

10 

6. RRVRsp: (nt.8339-836H (SEQ. ID. NO. 79) 

5'-ccacaagCTTCcGGaGTTACCCGAGTGACC 

7. RRV Afll : (nt.7820-7836) (SEQ. ID. NO. 80) 
1 5 5'-ccaccttaaGCGTCGGCTTTTTCTTC 

8. RRV Afl2: (nt.7892-7907) (SEQ, ID. NO. 81) 

5'-ccaccttaaGAGAAGAGAAAGAATG 

20 9. STNAva: fnt.7591-7594) (SEQ. ID. NO. 82) 
S'-ccacaagcttGGACCACCGTAGAG 

1 0. STNRnm: <nt.7325-73431 (SEQ. ID. NO. 83) 

5'-CCGCGTGGCGGATCCCCTG 

25 

1 1 . STNRsp: fnt.841 8-8433) (SEQ. ID. NO. 84) 

5'-ccacggatCCGGAAGGGACAGAAG 

1 2. STNRsn: fnt.8887-8902) (SEQ. ID. NO. 85) 
30 5'-CACGGTCCTGAGGTGC 
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PCR reactions were performed using the primer pairs indicated below, in a standard 
three cycle protocol, with 30 sec extensions and Vent polymerase, to produce the 
corresponding DNA fragments, which are also indicated below. 

5 

PCR frngments:primer pairs, pla smid template 
Fragment 1: prl+pr2, RRV6415 plasmid 
Fragment 2: pr3+pr4, RRV6415 plasmid 
Fragment 3: pr5+pr6, RRV6415 plasmid 
1 0 Fragment 4: pr3+pr7, RRV64 1 5 plasmid 
Fragment 5: pr4+prS, RRV6415 plasmid 
Fragment 6: pr9+prl0, Totol 101 piasmid 
Fragment 7: prl l+prI2, Totol 101 plasmid 

15 Following amplification, the PCR products were digested with the indicated enzymes, 
and ligated into the pUC18 plasmid analog, pRS2, which contains additional polylinker 
sites and which had also been digested with the same enzyme combinations: fragment 1 
was cut with EcoR I+Hind Ilk fragment 2 with BamH l+Hind Ilk fragment 3 with 
BamH I+Hind Ilk fragment 4 with BamH l+A/J Ik fragment 5 with Hind 111+ Afl Ik 

20 fraement 6 with BamH l+Hind Ilk and fragment 7 with BamH \+Bsu 36k All 
insertions were sequenced to verify that artifacts had not been acquired during PCR. 

Subsequently, the fragments were released from the pRS2 plasmids 
using the enzymes indicated below, and ligated exactly as indicated to generate the next 
set of constructs. To generate FR8, fragment 6 (cut by Bam HI and Ava II) was ligated 

25 with fragment 1 (cut by Ava II and Nci 7), fragment 2 (cut by Nci I and Hind III) and 
plasmid pRS2 (cut by Bam HI and Hind III). To generate FR9, fragment 6 (cut by 
BamH I and Ava II) was ligated with fragment 1 (cut by Ava II and Nci I), fragment 4 
(cut by Nci I and Afl II) and plasmid pRS2 (cut by BamH I and Afl II). To generate 
FRIO, fragment 5 (cut by Afl II and ApaL I) was ligated with fragment 3 (cut by ApaL I 

30 and Hind III) and plasmid pRS2 (cut by Afl II and Hind III). After transformation of 
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£. coli, plasmids were analyzed by restriction analysis and their inserts were again 
isolated by digestion and used for the next steps of cloning. 

The FR8 insert (cut by BamH I and ApaL I) was ligated with fragment 3 
(cut by ApaL I and BspM II), fragment 7 (cut by BspM II and BsuS6 I) and plasmid DH- 
5 BB {cut by BamH I and Bsu36 I). The same fragments also were used to replace the 
BamH \-Bsu36 I fragment of plasmid DH-BB(5'SIN). The resulting plasmids were 
designated DH-BB Crrv and DH-BB(5'SIN) Crrv, respectively (see Figure 23). The 
FR9 insert (cut by BamH I and AfllT) was ligated with the FRIO insert (cut by Afl //and 
BspM II), fragment 7 (cut by BspM II and Bsu36 I) and plasmid DH-BB (cut by BamH I 

10 and Bsu36 I). The same fragments also were used to replace the BamH \-Bsu36 I 
fragment of plasmid DH-BB(S'SIN). The resulting plasmids were designated DH-BB 
CArrv and DH-BB(5'SIN) CArrv (see Figure 23). 

Multiple uses for DH constructs that contain chimeric structural protein 
genes are possible, and two such approaches are illustrated in Figures 24 and 25. In 

15 Figure 24, the intact Ross River capsid protein gene is linked with the Sindbis 
glycoprotein gene sequences (DH-BB Crrv or DH-BB(5'SIN) Crrv), as pan of a 
defective helper construct, and co-transfected with a Sindbis reporter RNA vector 
replicon to demonstrate packaging into recombinant alphavirus particles (Figure 26). In 
Figure 25, the deleted form of the Ross River capsid protein gene is linked with the 

20 Sindbis glycoprotein gene sequences (DH-BB CArrv and DH-BB(5'SIN) CArrv), as a 
translational enhancer and pan of the DH construct, while the Sindbis capsid protein 
gene expressed from a second DH construct. Both DH constructs are co-transfected 
with a Sindbis reporter RNA vector replicon to demonstrate packaging into recombinant 
alphavirus particles (Figure 26). Additionally, the Ross River capsid protein gene may 

25 be expressed aione from one DH construct, while the Sindbis glycoproteins are 
expressed from another, for use in packaging. Using this knowledge and the 
availability of several other alphaviruses from which to derive structural protein gene 
sequences, a large number of different protein combinations may be generated in similar 
approaches. 
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Alternatively, the entire complement of structural protein genes from one 
alphavirus, or other members of the Togaviridae (e.g., rubella virus) may be used to 
package an RNA vector derived from another, as shown above for SFV vectors and 
Sindbis structural proteins. In an alternative embodiment, the structural protein genes 
5 from Venezuelan equine encephalitis (VEE) virus may be used to package a Sindbis- 
virus derived vector ('wild-type or displaying the phenotype described in Example 4 or 
5). Such a method provides recombinant alphavirus particles containing vector RNAs 
which exhibit the desirable properties of the present invention, such as delayed, reduced 
or no inhibition of host macromofecular synthesis, plus, structural proteins which 

10 redirect the tropism of the recombinant particle. Venezuelan equine encephalitis virus 
(VEE) is an alphavirus which exhibits tropism for cells of lymphoid origin, unlike its 
Sindbis virus counterpart. Therefore, Sindbis-derived vector constructs packaged by a 
cell line expressing the VEE structural proteins will display the same lymphotropic 
properties as the parental VEE virus from which the packaging cell structural protein 

1 5 gene cassette was obtained. 

Specifically, the Trinidad donkey strain of VEE virus (ATCC £VR-69) is 
propagated in BHK-21 cells, and virion RNA is extracted using procedures similar to 
those described in Example 1. The entire structural protein coding region is amplified 
by PCR with a primer pair whose 5'-ends map, respectively, to the authentic AUG 

20 translational start site, including the surrounding Kozak consensus sequence, and the 
UGA translational stop site. The forward primer is complementary to VEE nucleotides 
7553-7579, and the reverse primer is complementary to VEE nucleotides 1 1206-1 1186 
(sequence from Kinney et al., Virology 77(9:19-30, 1989). PCR amplification of VEE 
cDNA corresponding to the structural protein genes is accomplished using a two-step 

25 reverse transcriptase-PCR protocol as described above, the VEE genome RNA as 
template, and the following oligonucleotide pair, which contain flanking Xlw I and Not 
I sites: 

Forward primer: VEE 7553F f 5'-rest. site/VE E capsid seq.) (SEQ. ID. NO. 86) 
30 5'-TATATATATCTCGAGACCGCCAAGATGTTCCCGTTCCAGCCA-3' 
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Reverse primer: VEE 1I206R fS'-rest. site/VEE El eivco seq.) (SEQ. ID. NO. 87) 
5'-TATATATATGCGGCCGCTCAATTATGTTTCTGGTTGGT-3' 

5 Following PCR amplification, the approximately 3800 bp fragment is purified from a 
0.7% agarose gel using GENECLEAN II. and digested with XJio I and Not I. The 
resulting fragment is then ligated into the DNA-based Sindbis expression vector 
pDCMVSIN-iuc (see above), that also has been digested with Xho 1 and Not 1 to remove 
its luciferase reporter gene insert, treated with alkaline phosphatase, and purified from a 

10 0.7% agarose gel using GENECLEAN II. The resulting VEE structural protein 
expression construct is designated pDCMV-VEEsp. Plasmid pDCMV-VEEsp 
subsequently is digested, under limiting partial digest conditions, with BspE I to remove 
most nonstructural protein gene sequences, and re-ligated to create the structural 
protein-expressing DH vector construct, pDCMV-VEEdl. 

15 Plasmid pDCMV-VEEdl, which also contains a neomycin resistance 

marker, is transfected into BHK cells using Lipofectamine, as described by the 
manufacturer. Approximately 24 hr post-transfection, the cells are trypsinized and re- 
plated in media containing 600 fig/ml of the drug G418. The media is exchanged 
periodically with fresh G41 8-containing media and foci of resistant cells are allowed to 

20 grow. Cells are trypsinized and cloned by limiting dilution in 96 well tissue culture 
dishes, and individual cell clones are grown and expanded for screening. Cells which 
inducibly express VEE structural proteins in response to input vector are identified by 
transfecting with Sindbis luciferase vector RNA, and assaying for VEE structural 
protein expression in cell lysates or packaged luciferase vector in the supematants, as 

25 described previously. Structural protein genes obtained from variants of VEE, or other 
alphaviruses and their variants differing in tissue tropism, also are useful when 
following this approach. In addition, each of the various structural protein gene 
expression cassette configurations described in this example, including split structural 
gene PCL, may be used. 

30 
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E. P roduction or' package d alphavirus vectors from PCL 

Alphavirus derived PCL described throughout this example may be used 
in a number of different ways to produce recombinant alphavirus particle stocks, and 
include the introduction of vector by either transfection of RNA or DNA molecules, 
5 infection with previously produced packaged vector-containing particles, or the 
intracellular production of vector from stably transformed expression cassettes. The 
utility of alphavirus PCL for the production of vector particles is demonstrated first with 
a reporter vector construct, and later may be applied to any other vector constructs 
which express a desired heterologous sequence. For example, a stock of packaged 

10 Sindbis-P-gal vector particles is obtained by electroporation of approximately 10' 
alphavirus C/GLYCO or other PCL cells (see above, and for example, figures 15, 17, 
18) with 5-10 ug pKSSIN-l-BV-(3-gal or -luciferase RNA (Example 4) or pBG/SIN-l 
ELVS I.5-P-gal or -luciferase DNA (Example 5), using the procedure described in 
(Liljestrom and Garoff, Bio/Technology 9:1356-1361, 1991). The transfected PCL are 

15 incubated at a desired temperature {e.g., 37°C), and at approximately 48 hr. post- 
transfection, the supernatants are harvested and clarified by passage through a 0.45 
micron filter. Additional formulation may be performed using parameters illustrated in 
the detailed description of this invention. 

Alternatively, a stock of packaged recombinant alphavirus particles 

20 (obtained using PCL as above, or by co-transfection of vector and DH constructs) is 
used to infect a naive culture of PCL, for further amplification. For example, 5xl0 7 of 
alphavirus C/GLYCO or other PCL cells are infected with a stock of packaged 
pKSSFN-l-BV-p-gal vector at an approximate multiplicity of infection (MOI) of 1 
infectious unit/cell. Upon reaching the cell cytoplasm, the particle delivered RNA 

25 vector is autocatalytically amplified and packaged into additional progeny particles. 
After incubation at the desired temperature (e.g., 37°C) for approximately 48 hr., the 
culture supernatants are harvested, clarified by passage through a 0.45 micron filter, and 
formulated as desired. 

In still another method which exploits the ability of PCL to further 

30 amplify packaged recombinant alphavirus vector particles, stocks of packaged panicles 
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are used to infect naive cultures of PCL to create a working cell bank of vector- 
containing PCL (vector producing cells. Figure 27), which may subsequently be used to 
seed another naive culture of PCL. For example, such a working cell bank is obtained 
by infection of alphavirus C/GLYCO or other PCL cells with the packaged vector stock 
5 at a M.O.I.- 5. Approximately 2-3 hr post-infection, the vector containing PCL are 
gently detached and cell number is determined. The vector containing PCL may now 
be used directly, or aliquoted and stored in liquid N 2 as a vector producing cell bank. 
When desired, the cells are seeded directly into a previously growing culture of naive 
alphavirus C/GLYCO or other PCL at a ratio of approximately 1 vector producing cell 

10 per 1000 fresh PCL. for production of large quantities of high titer packaged vector 
panicles. Aliquots of culture supernatant are harvested at various times post-coculture 
to determine the time of maximal recombinant alphavirus panicle production, and that 
time is chosen for further harvest, purification and formulation, as described above. 
The same sequential amplification methodology using vector producing cells also is 

15 useful for targe-scale production of any desired recombinant protein (Figure 27). For 
the production of recombinant protein, supernatants or cell lysates may be harvested, 
depending on the nature of the recombinant protein. 

In yet another method for producing high titer or large scale stocks of 
packaged recombinant alphavirus panicles, the desired expression vector is introduced 

20 into 1-5% of a naive alphavirus PCL culture by transfection of in vitro transcribed RNA 
or plasmid DNA vector using a commonly accepted reagent or method (for example, 
Lipofectin or Lipofectamine. respectively, or infection with vector particles at low MOI 
[< 0.1]), as described herein. The recombinant vector particles produced by the initial 
ceils, into which vector was introduced, subsequently infect other naive packaging cells 

25 in the culture, which in turn, produce even more packaged particles. This process of 
temporal amplification continues until packaged recombinant alphavirus particles are 
produced in all cells of the PCL culture. 

The amplification process is demonstrated in Figures 26D and 28. 
ELVS 1.5-P-gal plasmid DNA was transfected into 987DHBBNeo packaging ceils or 

30 into BHK-21 cells, and the levels of p-galactosidase present in cell lysates was 
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measured, as described previously, at the indicated times post-transfection. In BHK-21 
cells, the level of p-galactosidase expression reached a maximum by approximately 48 
hpt, and plateaued. In contrast, the level of [3-galactosidase expression continued to 
increase over a longer period of time in the ELVS 1.5-p-gal transfected 987DHBBNeo 
5 PCL culture, reflecting the recombinant vector particle amplification process, and the 
ultimate expression of p-galactosidase in all of the cells of the culture. Further, 
infection of split structural gene PCL with Sindbis vector particles (Figure 26D) also 
resulted in panicle amplification. In all cases, stocks of recombinant alphavirus vector 
panicles may be formulated so as to be pharmaceutical^ acceptable, using any of the 
10 methods described herein. 

EXAMPLE 7 

Construction of Alphavirus Producer Cell Lines 

15 The generation of alphavirus PCL, as described above, coupled with the 

construciion of DNA-based alphavirus vectors exhibiting reduced, delayed, or no 
inhibition of host cell macromoiecular synthesis (Examples 1, 2, 4 and 5), provides a 
relatively straightforward mechanism to derive alphavirus vector producer cell lines. In 
certain embodiments of the present invention, the vector producer cell lines contain one 

20 or more stably transformed structural protein gene expression cassettes, and also 
alphavirus RNA expression vector molecules with the above phenotype, that are 
transfected, transduced, or intracellularly produced, leading to the production of 
packaged vector particles. In preferred embodiments, an RNA vector replicon is 
produced intracellularly from a stably transformed DNA molecule (eukaryotic layered 

25 vector initiation system) that exists in either an integrated form or as an episomal DNA, 
with transcription of vector RNAs being controlled inducibly by one or more stimuli 
provided at a desired time. This type of alphavirus producer cell line configuration 
essentially provides a cascade of events that include: inducible production of vector 
RNA and resulting autocatalytic cytoplasmic amplification of the RNA, the induction of 

30 high level structural protein expression by vector-supplied nonstructural proteins, the 
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packaging of vector RNA by the expressed structural proteins, and the release of 
packaged vector particles. Tightly regulated, inducible expression of vector RNA from 
the DNA molecule, once producer cell population reaches as desired number, is 
preferred, due to the potential for low level cytotoxicity of vector replication, or the 
5 necessity to control nonstructural protein synthesis, as it relates to the regulation of 
positive strand versus negative strand vector RNA ratios. 

A. Alphavirus DNA Vectors with Single Level Regulation 

In certain embodiments of the present invention, a DNA-based 

10 alphavirus vector is provided, wherein in vivo transcription of an alphavirus vector 
RNA molecule that is capable of autocatalytic amplification occurs from a promoter 
which is regulatable by applying a stimulus at a desired time. Such a DNA-based 
alphavirus vector subsequently may be stably transformed into an alphavirus packaging 
cell line (PCL) to create an inducible alphavirus producer cell line. The producer cell 

15 line configuration described herein, is therefore, a "feed- forward" system in which: 1) a 
stimulus is applied to the cell, resulting in efficient transcription of alphavirus vector 
RNA; 2) the vector RNA replicates autocatalytically and produces nonstructural 
proteins; 3) the nonstructural proteins stimulate amplification of the structural protein 
expression cassette mRNAs and high level structural protein expression; and 4) the 

20 structural proteins interact with the vector RNA and result in the subsequent packaging 
of recombinant alphavirus particles which are released into the culture media. Any 
previously described alphavirus PCL, which is stably transformed with one or more 
inducible alphavirus structural protein expression cassettes, may serve as the parental 
line with which to derive the producer cell line. 

25 For example, a tetracycline-responsive promoter system (Gossen and 

Bujard, Proc. Natl. Acad. ScL 59:5547-5551, 1992) may be utilized for inducible 
transcription of an alphavirus vector RNA, as depicted in Figure 29. In this system, the 
expression of a tetracycline repressor and HSV-VP16 transactivator domain, as a 
"fusion" protein (rTA), stimulates in vivo transcription of the alphavirus vector RNA by 

30 binding specifically to a tetracycline operator sequence (tetO) located immediately 
adjacent to a minimal "core" promoter (for example, CMV). The binding and 
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transactivation event is reversibly blocked by the presence of tetracycline, and may be 
"turned on" by removing tetracycline from the culture media. As uninduced basal 
levels of transcription will vary among different cell types, other different minimal core 
promoters (for example HSV-tk) may be linked to the tetracycline operator sequences, 
5 provided the transcription start site is known, to allow juxtaposition at or in the 
immediate proximity of alphavinis vector nucleotide 1 . 

The rTA transactivator is provided by an additional expression cassette 
also stably transformed into the same cell line; and in certain embodiments, the rTA 
expression cassette may itself be autoregulatory. The use of an autoregulatory rTA 

10 expression cassette circumvents potential toxicity problems associated with constitutive 
high ievel expression of rTA by linking expression to transcriptional control by the 
same tetO-linked promoter to which rTA itself binds. This type of system creates a 
negative feedback cycle that ensures very little rTA is produced in the presence of 
tetracycline, but becomes highly active when the tetracycline is removed (Figure 29). 

15 Such an autoregulatory rTA expression cassette is provided in plasmid pTet-tTAk 
(Shockett et ai.. Proc. Nad. Acad. Sci. USA 92:6522-6526, 1995). 

Functionality of such a tetracycline-regulated DNA-based alphavinis 
vector is demonstrated by constructing a modified SIN- 1 -derived luciferase plasmid 
vector, which is driven by a tetracycline operator/CMV minimal promoter. Using 

20 plasmids pBG/SIN-1 ELVS1.5-luc (Example 4) and pBGSV3' (Example 6) as starting 
material, an approximately 7200 bp fragment, including much of the SIN-1 
nonstructural -encoding region, the junction region promoter and luciferase reporter 
gene, and a portion of the 3'-UTR, is isolated by digestion of pBG/SIN-1 ELVS1.5-luc 
with Bgl //and Fsp I, and purification from a 0.7% agarose gel using GENECLEAN II. 

25 The 7200 bp fragment is subsequently ligated into plasmid pBGSV3' that has also been 
digested with Bgl II and Fsp I, treated with alkaline phosphatase, and purified from a 
0.7% agarose gel using GENECLEAN II. The resulting construct is designated 
pBGSVdlB/SINl-luc. Insertion of the remaining sequences, which include the 
heptamerized tetracycline operator and minimal CMV promoter (tetO/CMV) linked to 

30 Sindbis nucleotides 1-2289, such that transcription will initiate with one additional 
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nonviral nucleotide 5' of Sindbis nucleotide 1, is accomplished by overlapping PCR. In 
PCR reaction #i. the approximately 370 bp tetO/CMV portion of the sequence is 
amplified by standard three-cycie PCR with a 30 second extension from template 
plasmid pUHC13-3 (Gossen and Bujard. ibid) using the following oligonucleotide 
5 primers that are designed to also contain flanking Bgl II and Asc I sites on one primer 
and sequences overlapping 5'-Sindbis nucleotides on the other. 

Forward primer: 5'BAtetOF f5'-rest. sites/tetO nts.) (SEQ. ID. NO. 88) 
5 , -TATATAGATCTGGCGCGCGTTTACCACTCCCTATCAGTGATAG-3' 

10 

Reverse primer vrMVnro/SINR (5'-Sindbis ms./CMV nts.) (SEQ. ID. NO. 89) 
5'-TACGCCGTCAAT/ACGGTTCACTAAACGAGCTCTGC-3' 

In PCR reaction #2. the 2289 bp Sindbis 5'-end portion of the sequence is amplified by 
15 standard three-cycle PCR with a three minute extension, from template plasmid 
pKSRSIN-1 (Example 1), using the following oligonucleotide primers that are designed 
to also contain sequences overlapping the CMV promoter nucleotides on one primer. 

Forwnrd primer: CMVSINS'endF f5'-CMV nts./Sindbis nts.) (SEQ. ID. NO. 90) 
20 5'-TAGTGAACCGT/ATTGACGGCGTAGTACACACTATT 

Reverse primer srN2400R fall Sindbis nts.) (SEQ. ID. NO. 91) 
5'-CGTTGAGCATAACCGAATCTAC 

25 Following amplification, the DNA fragments are purified with QIAquick-spin and used 
together as templates in a subsequent three-cycle PCR reaction with 3.5 minute 
extension, using additional 5'BAtetOF and SIN2400R primers. The resulting 
overlapping PCR amplicon of approximately 2660 bp is purified using GENECLEAN 
II, digested with Bgl II, and ligated into plasmid pBGSVdlB/SINl-luc that has also 

30 been digested with Bgl II, treated with alkaline phosphatase, and purified from a 0.7% 
agarose gel using GENECLEAN II. The resulting construct is designated ptetSINl-luc. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/21062 



183 

Vector constructs containing other heterologous sequences-of-interest are generated 
using a similar approach, or by direct cloning into the XJjo I and/or Not I sites. 
Subsequently, a selectable E. coli gpt gene (xanthine-guanine phosphoribosyltrans- 
ferase) expression cassette is generated and inserted into the unique Pac 1 site of 
5 plasmid ptetSINl-luc. to provide an additional selectable marker. First, a fragment 
containing the SV40 promoter linked to a gpt gene open reading frame is amplified 
from plasmid pMAM (Clontech, Palo Alto, CA) by standard three-cycle PCR with a 2 
minute extension, and using the following oligonucleotide primers that are designed to 
contain upstream flanking Sac 1 and Pac I sites and a downstream Sac I site. 

10 

Forward primer: SV40proSPF (5'-rest. sites/SV40 promoter seq.) (SEQ. ID. NO. 92) 
S'-ATATAGAGCTCTTAATTAA/TCTTTGTGAAGGAACCTTACTTC 

Reverse primer: 3'ECgptR f5'-rest. site/gpr gene seq.) (SEQ. ID. NO. 93) 
1 5 5'-ATATAGAGCTC/AGGCGTTGAAAAGATTAGCGACCG 

Following amplification, the SV40 promoter/gpf gene DNA fragment is purified with 
QIAquick-spin, digested with Sac I, purified using GENECLEAN II, and ligated into 
piasmid pBGS131 dLV7ioI-BGHTT (Example 5) that also had been digested with Sac I. 

20 treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 
GENECLEAN II. Clones with proper orientation of the insert are identified by 
restriction analysis. This configuration positions the promoter and gpt gene 
immediately adjacent to a bovine growth hormone transcription termination signal. The 
resulting gpt expression construct is designated pBGS131 dUTioI-gpt. Next the entire 

25 expression cassette is amplified from plasmid pBGS131 dlA7ioI-gpt by standard three- 
cycle PCR with a 2 minute extension, and using the following oligonucleotide primers 
that are designed to contain flanking Pac I sites. 

Forward primer: SV40proSPF. as shown above (SEQ. ID. NO. 92) 

30 
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Reverse primer: BGHTTpacR (S'-rest. site/BGH seq.) (SEQ. ID. NO. 94) 
S'-TATATATTAATTAA/ATAGAATGACACCTACTCAGACAATGCGATGC 

5 Following amplification, the gpt gene expression cassette fragment is purified with 
QIAquick-spin, digested with Pac I, purified using GENECLEAN II. and ligated into 
the tet-inducible alphavirus vector construct ptetSINl-luc that also had been digested 
with Pac I. treated with alkaline phosphatase, and purified from a 0.7% agarose gel 
using GENECLEAN II. The resulting construct is designated ptetSINlgpt-luc. 

10 For construction of an initial tetracycline-inducible alphavirus vector 

producer cell line, the ptetSrNlgpt-luc construct and a tetracycline repressor/VP16 
transactivator (rTA) expression cassette are stably transformed into the desired 
alphavirus PCL. For example, alphavirus C/GLYCO PCL cells (from above) are stably 
transformed with plasmid pTet-tTAk (see above) by cotransfection with another 

1 5 plasmid encoding a selectable marker. Plasmids pTet-tTAk and pSV2-His, encoding a 
histidinol dehydrogenase marker (Schatz et al., 1989, Cell 5P: 1 035- 1 04S), are co- 
transfected into C/GLYCO PCL cells (or other PCL) at a molar ratio of 40:1, 
respectively, using Lipofectamine, as described by the manufacturer. Approximately 24 
hours post-transfection, the cells are trypsinized and re-plated in media containing 

20 histidinol and 0.5 ug/'ml tetracycline. The media is exchanged periodically with fresh 
drue-containing media, and foci of resistant celis are allowed to grow. Cells are 
trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 
individual cell clones are grown are expanded for screening. Positive pTet-tTAk- 
containing packaging cell clones, designated C/GLYCO/TAk cells, are identified by 

25 transfecting the luciferase reporter plasmid pUHC13-3 (Gossen and Bujard, ibid), under 
the control of a tetO/promoter. in both the presence or absence of tetracycline. In the 
absence of tetracycline, positive C/GLYCO/TAk PCL cells will provide induction from 
the tetO/promoter and inducible, high levels of luciferase. 

Subsequently, the DNA-based alphavirus vector construct ptetSINlgpt- 

30 luc is stably transfected into the C/GLYCO/TAk cells using Lipofectamine, as 
described by the manufacturer. Approximately 24 hr post-transfection, the cells are 



SUBSTITUTE SHEET (RULE 26) 



WO 99/18226 



PCT/US98/21062 



185 

trypsinized and re-plated in selection media, optimized for the particular cell type 
(DMEM + 10% dialyzed fetal calf serum; 250 ug/ml xanthine; 15 ug/ml hypoxanthine; 
10 ug/ml thymidine: 2 ug/ml aminopterin; 25 ug/ml mycophenolic acid), and containing 
0.5 ug/ml tetracycline. The media is exchanged periodically with fresh selection media, 
5 and foci of resistant cells are allowed to grow. Cells are trypsinized and cloned by 
limiting dilution in 96 well tissue culture dishes, and individual cell clones are grown 
are expanded for screening. Positive producer cell lines, stably transformed with 
ptetSINlgpt-luc, are identified by removing tetracycline from the media for at least 24 
hr and testing for luciferase in cell lysates and also testing for packaged luciferase 
!0 vector in the culture supernatants. as described previously. 

B. Alphavims DNA Vectors With Two Level Regulation 

In preferred embodiments, it may be desirable to construct a DNA-based 
alphavims vector (wild-type or with the desired phenotype of reduced, delayed or no 

15 inhibition of host macromolecular synthesis), wherein transcription of the RNA vector 
molecule, capable of autocatalytic amplification, occurs from a promoter which is very 
tightly controlled by two levels of regulation to eliminate all basal levels of 
transcription. Such an approach may combine one inducible component (e.g., the tet 
system from above) with a reversible transcriptional silencing component. For 

20 example, the KRAB repression domain of a certain zinc finger protein may be used. 

Briefly. KRAB (Kxiippel-associated box) domains are highly conserved 
sequences present in the amino-terminal regions of more than one-third of all Kriippel- 
class Cys2HiS2 zinc finger proteins. The domains contain two predicted amphipathic 
a-helicies and have been shown to function as DNA binding-dependent RNA 

25 polymerase II transcriptional repressors (for example, Licht et al.. Nature 346: 16-79, 
1990). Like other transcription factors, the active repression domain and the DNA- 
binding domain are distinct and separable. Therefore, the repression domain can be 
linked as a fusion protein to any sequence specific DNA binding protein for targeting. 
Ideally, the DNA binding protein component can be reversibly prevented from binding 

30 in a regulatable fashion, thus turning "off the transcriptional silencing. For example, 
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within one embodiment the KRAB domain from human Koxi (Thiesen. New Biol. 
2:363-374, 1990) is fused to the DNA-binding lactose (lac) repressor protein, forming a 
hybrid transcriptional silencer with reversible, sequence-specific binding to a lac 
operator sequence engineered immediately adjacent to the tet-responsive promoter 
5 (Figure 30). In this configuration, constitutive expression of the lac repressor/KRAB 
domain fusion (rKR) will result in binding to the lac operator sequence and the 
elimination of any "leaky" basal transcription from the uninduced tet-responsive 
promoter. When vector expression is desired and tetracycline is removed from the 
system, IPTG is added to prevent rKR-mediated transcriptional silencing. 

10 In addition, the KRAB domains from other zinc finger proteins, for 

example, ZNFI33 (Tommerup et al.. Hum. MoL Genet. 2:1571-1575, 1993), ZNF91 
(Bellefroid et al., EMBO J. 72:1363-1374. 1993), ZNF2 (Rosati et al., Nucleic Acids 
Res. 19:5661-5661. 1991), and others, as well as other transferable repressor domains, 
for example, Drosophila en or eve genes (Jaynes and O'Farreil, EMBO J. 10: 1427-1 433, 

15 1991; Han and Manley, Genes Dev. 7:491-503, 1993), human zinc finger protein YY1 
(Shi et al., Cell (57:377-388, 1991), Wilms 1 tumor suppressor protein WT1 (Madden et 
al., Science Z5J: 1550-1 553, 1991), thyroid hormone receptor (Baniahmad et al., EMBO 
J. 1 7:101 5- 1 023, 1992), retinoic acid receptor (Baniahmad et al., ibid), Kid-1 (Witzgall 
et al., Proc. Natl. Acad. Sci. USA 97:4514-4518, 1994), are readily used in such a 

20 system. Furthermore, the lac repressor/lac operator component of this system may be 
substituted by any number of other regulatable systems derived from other sources, for 
example, the tryptophan and maltose operons, GAL4, etc. 

Specifically, an expression cassette that contains the lac repressor (lacf) 
protein fused to the ICRAB domain of human Koxl, with a linked nuclear localization 

25 sequence (NLS; Pro-Lys-Lys-Lys-Arg-Lys (SEQ. ID. NO. 100); Kalderon et al., Cell 
JP:499-509. 1984) to more efficiently direct the protein back to the nucleus, is 
constructed by overlapping PCR. In PCR reaction #1, the approximately 1 100 bp lad 
sequence is amplified by standard three-cycle PCR with a 1.5 minute extension, from 
template plasmid p3'SS (Stratagene, La Jolla, CA), using the following oligonucleotide 

30 primers that are designed to also contain a flanking Xlw I site and AUG start codon in 
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good translation initiation context on the upstream primer, and the SV40 large-T- 
antigen nuclear locaiization sequence on the other. 

Forward primer: LacIST (5'-rest. site/AUG + lad sequence) (SEQ. ID. NO. 95) 
5 S'-ATATACTCGAGTAGCA/ATGGTGAAACCAGTAACGTTATAC 



Reverse primer: /.ac/3'NLSR (S'-NLS/lacI sequenced (SEQ. ID. NO. 96) 
S'-GCCCTTTCTCTTCTTTTTTGG/CTGCCCGCTTTCCAGTCGGGAAAC 



10 In PCR reaction #2. the an approximately 400 bp amplicon, comprising the amino- 
terminal 121 residue iCRAB domain of human Koxl is amplified by standard three- 
cycle PCR with a one minute extension, from template plasmid pKoxl (Thiesen, New 
Biol. 2:363-374, 1990), using the following oligonucleotide primers that are designed to 
also contain sequences overlapping NLS and lad on one primer and a Sac I restriction 

1 5 site and stop codon on the other. 

Forward primer: KRAB5'F f5'-NLS-HacI overlap sequence/KRAB sequence) 
(SEQ. ID. NO. 97) 

5'-CCAAAAAAGAAGAGAAAG/GGCGGTGGTGCTTTGTCTCCT 

20 

Reverse pnmer: KRAB3'R (5'-rest. site+stop codon/KRAB sequence) 
(SEQ. ID. NO. 98) 

5*-ATATAGAGCTCTTA/AACTGATGATTTGATTTCAAATGC 



25 Following amplification, the DNA fragments are purified with QIAquick-spin and used 
together as templates in a subsequent three-cycle PCR reaction with 2.5 minute 
extension, using additional LacI5'F and KRAB3'R primers. The resulting overlapping 
PCR amplicon of approximately 1500 bp is purified using GENECLEAN II, digested 
with XJw I and Sac L and ligated into the eukaryotic expression vector plasmid pEUK- 

30 CI (Clontech, Palo Alto, CA) that has also been digested with Xho I and Sac /, treated 
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with alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN 
II. The resulting ladfKRAB expression construct is designated pEUK-rlCR. 

To generate stable PCL iransformants containing the lad/KRAB 
expression cassette, an alphavirus PCL which has already been selected for 
5 transformation with an rTA tet/transactivator fusion protein cassette is used for starting 
material. For example, alphavirus C/GLYCO/TAk PCL cells (from above) are stably 
transformed with plasmid pEUK-rlCR by cotransfection with another plasmid encoding 
a selectable marker. Plasmids pEUK-rKR and pPUR, encoding a puromycin 
acetyltransferase selectable marker (Clontech), are co-transfected into C/GLYCO/TAk 

1 0 PCL cells for other PCL) at a molar ratio of 40; 1 . respectively, using Lipofectamine. as 
described by the manufacturer. Approximately 24 hr post-transfection, the cells are 
trypsinized and re-plated in media containing 5 ug/ml puromycin and 0.5 ug/ml 
tetracycline. The media is exchanged periodically with fresh drug-containing media, 
and foci of resistant cells are allowed to grow. Cells are trypsinized and cloned by 

15 limiting dilution in 96 well tissue culture dishes, and individual ceil clones are grown 
are expanded for screening. Positive pEUK-rKil-containing packaging cell clones, 
designated C/G/TAk/rKR cells, are identified by immunostaining with a polyclonal 
antiserum specific for lac! (Stratagene, La Jolla, CA). 

Next, specific lac operator (lacO) sequences must be inserted into the 

20 desired ptet-based alphavirus vector (see above). For example, vector construct 
ptetSINlgpt-luc is modified to contain multiple copies of lacO by using a synthetic 
oligonucleotide linker. The LacO oligonucleotide is designed to contain a symmetric 
lacO sequence, including the full 22 bp palindromic operator sequence (Simons et al M 
Proc.Natl. Acad. Sci. USA 57:1624-1628, 1984; Sadler et al., Proc. Natl. Acad. Sci. 

25 USA 50:6785-6789, 1983), and flanking Asc I sites when self-annealed into a double- 
stranded molecule. 

LacOsvmA (SEQ. ID. NO. 99) 
5'-CGCGCCGAATTGTGAGCGCTCACAATTCGG 

30 
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The LacOsymA oliao is self-annealed to form a Asc I "sticky-ended" DNA fragment, 
and then ligated into plasmid ptetSINlgpt-luc that has been digested with Asc I, treated 
with alkaline phosphatase, and purified from a 07% gel using GENECLEAN II. 
Clones containing one, two. three, or more tandem copies of the iacO sequence are 
5 identified by sequence analysis, and given the designation pOItetSINlgpt-luc. 
pOIItetSINlgpt-luc. pOIIItetSINlgpt-luc. etc. Individual clones with different IacO 
copy numbers are then transfected as detailed below, and tested for the tightest level of 
transcriptional regulation. 

To generate an alphavirus vector producer cell line, the DNA-based 

10 pOtetSINtgpt-luc vector constructs are stably transfected into C/G/TAk/rKR cells using 
Lipofectamine. as described by the manufacturer. Approximately 24 hr post- 
transfusion, the cells are trypsinized and re-plated in selection media, optimized for the 
particular cell type (DMEM + 10% dialyzed fetal calf serum; 250 ug/ml xanthine; 15 
ug/ml hypoxanthine: 10 ug/ml thymidine; 2 ug/ml aminopterin; 25 ug/ml mycophenolic 

15 acid), and containing 0.5 ug/ml tetracycline. The media is exchanged periodically with 
fresh selection media, and foci of resistant cells are allowed to grow. Cells are 
trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 
individual cell clones are grown are expanded for screening. Positive producer cell 
lines, stably transformed with the pOtetSINlgpt-luc constructs, are identified by the 

20 expression of luciferase (described previously) at least 24 hr after the removal of 
tetracycline from the media and the addition of 20mM IPTG for induction. Luciferase 
activity is determined both on producer cell lysates and also after transfer-of-expression 
experiments using culture supematants. 

Additional levels of control may be incorporated by adding a third, or 

25 even fourth, level of regulation to the promoter responsible for transcription of the 
alphavirus vector molecule. Such extra level or regulation may be incorporated into the 
minimal promoter, and may involve other inducible systems and/or cell differentiation 
control. In each of the above cases, stable transformation may be accomplished as an 
integration into the host ceil chromosome, or as an extrachromosomal episome, using 

30 for example, the EBV episomal-based vector promoter (for non-integrated). 
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EXAMPLE 8 

Methods for the Generation of Alpha virus-Derived Empty 
or Chimeric Viral Particles 

5 

As illustrated in Example 6. individual defective helper (DH) expression 
cassettes can be constructed to contain elements from multiple alphaviruses or their 
variants. Thus, as described in Example 6. split structural gene DH cassettes for the 
expression of the viral glycoproteins can be constructed to contain the capsid and 

0 glycoprotein genes from different alphavirus species. For example, such a heterologous 
alphavirus glycoprotein DH cassette might contain the capsid gene from Ross River 
virus (RRV). and the glycoprotein genes from Sindbis virus. In this configuration, the 
RRV capsid gene serves to enhance the level of translation of the glycoprotein genes. 

The configurations described herein for the heterologous alphavirus 

5 glycoprotein DH cassettes are designed to improve the packaging of vector replicons 
into alphavirus particles, yet diminish the possibility of recombination, resulting in the 
formation of replication competent alphavirus. The heterologous alphavirus 
glycoprotein DH expression cassette is a replacement of the Sindbis virus capsid gene 
in the DH expression cassettes described in Example 6 ("genomic' 1 structural protein 

0 gene PCL), with a heterologous alphavirus capsid gene (e.g. RRV). The second DH 
expression cassette in the split structural gene PCL contains, for example, the Sindbis 
virus capsid gene. Thus, a split structural gene PCL for the generation of recombinant 
alphavirus vector panicles having Sindbis virus structural proteins can be derived, for 
example, with the Sindbis virus glycoprotein genes and capsid genes on individual DH 

5 expression cassettes. 

It has been shown previously that chimeric viruses containing all of the 
genes of RRV, but with the capsid gene from Sindbis virus, or the reciprocal chimeric 
virus, do not assemble into infectious virus particles (Lopez et. al., J. Virol. 68: 1316- 
1323, 1994). The authors concluded in this report that the interaction between the 

0 carboxy terminus of glycoprotein E2 and capsid protein in virus assembly cannot occur 
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between the structural proteins of heterologous alphaviruses. Thus, recombinant 
genomes arising in the split structural gene alphavirus PCLs described in Example 6. 
consisting of the Sindbis virus non-structural protein genes (originating from the vector 
replicon), the RRV capsid gene, and the Sindbis virus glycoprotein genes, should not be 
5 replication competent, resulting in the propagation of virus (replication competent 
Sindbis virus, RCSV). The packaging restriction between heterologous alphavirus 
species permits the construction of DH cassettes comprised of the capsid gene, 
including the translational enhancement element, from one alphavirus, and the 
glycoprotein genes from a different alphavirus. 
0 However, as illustrated in Figure 31, the observation of Lopez et. al. 

(ibid), that assembly cannot occur between the structural proteins of heterologous 
alphaviruses. is incorrect. Indeed, a DH cassette consisting of the RRV capsid gene, 
and the Sindbis virus glycoprotein gene produces infectious virus panicles. Briefly, 
BHK cells were co-electroporated with SINrep/Lac Z replicon (Bredenbeek et. al., J. 
5 Virol.. 67:6439-6446, 1993), and DH-BB (5 T tRNA/SIN) Crrv (Example 6 and Figure 
24; RRV capsid/Sindbis virus glycoproteins) in vitro transcribed RNAs. The 
electroporation and in vitro transcriptions were performed as described in Example 1. 
Following electroporation. the BHK cells were treated with dactinomycin and labeled 
with [ J H]undine, exactly as described in Example 1. At 18 hours post electroporation, 
0 the culture medium was collected, and clarified by centrifugation at 6,000 rpm for 10 
min. The vector particles remaining in the supernatant were pelleted by 
ultracentrifugation, after first layering over a sucrose cushion. RNA was isolated from 
the BHK cells at 18 hours post electroporation, and from the virus pellet, 
electrophoresed on denaturing glyoxal agarose gels, and visualized by autoradiography, 
5 exactly as described in Example 1. The viral RNAs present in BHK cells electoporated 
with SINrep/LacZ and DH-BB Crrv RNAs, and in virus panicles, are shown in Figure 
31 (lane 1, panel A, and lane 1, panel B). RNAs corresponding to the genomic and 
subgenomic replicative species for SINrep/LacZ and DH-BB Crrv RNAs were present 
in both electroporated BHK cells, and the produced virus panicles. The results 
0 demonstrate, in contrast to Lopez et. al. (ibid), the formation of chimeric alphavirus 
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panicles consisting of RRV capsid protein and Sindbis virus glycoproteins. Further, the 
indiscriminate packaging of genomic and subgenomic SINrep/LacZ and DH-BB Crrv 
RNAs in chimeric alphavirus panicles indicates the inability of the Ross River capsid 
protein to recognize specifically the Sindbis virus packaging sequence, which is present 
5 in the nsPl gene of the SINrep/LacZ vector replicon. 

The viral proteins present in BHK cells electoporated with SINrep/LacZ 
and DH-BB Crrv RNAs, at 18 hours post electroporation, and the produced virus 
panicles are given in Figure 32 (lane 1, panel A, and lane 1, panel B). The viral-specific 
structural proteins in electroporated cells and the produced chimeric alphavirus particles 

10 were indistinguishable. That is. Figure 32 demonstrates clearly that virus panicles 
produced from BHK cells electroporated with SINrep/LacZ and DH-BB Crrv RNAs 
contained the RRV capsid and the Sindbis virus glycoproteins El and E2. This result 
provides indisputable evidence that in contrast to Lopez et. aL, (ibid), there is no 
restriction in assembly between heterologous alphavirus capsid and glycoproteins that 

1 5 prevents the formation of chimeric viral panicles. 

Thus, in distinct contrast to the results and discussion of Lopez et. al., the 
amino terminus of the RRV capsid protein is able to bind with the heterologous Sindbis 
virus genome, and form infectious chimeric alphavirus particles. Importantly, the 
previous conclusion that there is a restriction of virus assembly between heterologous 

20 alphavirus capsid proteins and glycoproteins is incorrect. The generation of chimeric 
alphavirus panicles as described here would, then, also result in the formation of RCSV 
in the split structural gene PCLs described above, since a recombinant genome 
consisting of the Sindbis virus non-structural protein genes (originating from the vector 
replicon), the RRV capsid gene, and the Sindbis virus glycoprotein genes, would 

25 generate infectious virus. Alternatively, this lack of restriction of packaging between 
distinct alphavirus structural proteins and vector replicons permits the tropism of vector 
panicles to be modified. For example, Sindbis virus replicons can be packaged with the 
Venezuelan Equine Ecephalitis virus structural proteins, in order to generate a 
lymphotropic recombinant vector particle. 
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The results described in two separate previous investigations have shown 
that ablation, in vitro, of the interaction between the capsid protein and the positive 
RNA-stranded genome of two icosahedral viruses having triangulation numbers (T)=3, 
turnip crinkle virus (TCV), and southern bean mosaic virus (SBMV), resulted in the 
5 disassociation of the virus particles, and the formation of nucleic acid-free T=l panicles 
(Sorger et. al. J. Mol. Biol., 191:639-656, 1986. and Erickson and Rossmann, Virology 
1 16:12S-136. 1982). In the absence of nucleic acid, T=3 particles similar to wild-type 
virus were not formed in vitro. Owen and Kuhn (J. Virol., 70:2757-2763, 1996), 
investigated the packaging properties of Sindbis virus genomes containing deletions in 

10 the capsid. in order to identify the region of the capsid protein that is required for 
dictating specificity of the encapsidation reaction, in vivo. One mutant virus [CD(97- 
106)] which contained a deletion corresponding to residues 97-106 of the capsid, 
encapsidated both genomic and subgenomic RNAs, indicating the domain of the capsid 
protein required for specific recognition of the genomic RNA packaging signal. In yet 

15 another report, the packaging properties of Aura alphavirus were investigated 
(Rumenapf et. al. J. Virol., 69:1741-1746, 1995). In this study, a mechanism for 
alphavirus packaging that involves a capsid protein-encapsidation sequence interaction 
initiation complex was proposed. This mechanism proposed is based on observations 
by the authors, and others (including Owen and Kuhn. ibid), in which 26S and 49S 

20 alphavirus RNAs are packaged into T=l. T=3. T=4, and T=7 virus particles, and that 
empty capsids arising during infection with alphaviruses have not been reported. 

Based on the literature presented above and the discussions contained 
therein, a RRV capsid gene deleted of the region corresponding to the capsid protein 
domain that is required for dictating specificity of the encapsidation reaction, and, in 

25 addition, surrounding basic residues that bind electrostatically with viral RNA, should 
not be able to form stable capsid particles containing viral RNAs. Thus, the alphavirus 
structural proteins expressed from a heterologous alphavirus DH cassette, consisting of 
this deleted RRV capsid gene and the Sindbis virus glycoprotein genes, should not 
assemble into stable chimeric alphavirus particles. Thus, in the split structural gene 

30 PCL discussed above and in Example 6, a recombinant genome consisting of the 
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Sindbis virus non-structural protein genes, the RRV capsid gene (deleted of the region 
corresponding to packaging specificity and the surrounding basic residues), and the 
Sindbis virus glycoprotein genes, could not generate infectious virus. As described in 
Example 6 ; the Sindbis virus capsid protein is expressed from a separate DH expression 
5 cassette; thus, the three Sindbis virus structural proteins are expressed in toto, resulting 
in the production of recombinant vector panicles. 



of the expressed protein that bind to the Sindbis virus packaging sequence (Weiss et. al., 
Nuc. Acids. Res.. 22:780-786, 1994, and Lopez et. al., ibid), including the basic 

10 residues which bind electrostatically with the viral RNA. were deleted in order to 
construct a heterologous alphavirus capsid-glycoprotein DH that provided translational 
enhancement and correct pE2-6K-El polyprotein processing by post-translational 
cleavage, yet could not assemble stable chimeric RRV/Sindbis virus particles. Figure 
33 illustrates the hydrophobicity profiles (Kyte-Dolittle) of the RRV capsid protein, and 

15 the capsid protein expressed from 3 individual RRV capsid gene mutants (CAlrrv, 
CA2rrv, and CA3rrv), in which varying amounts of the capsid gene encoding a lysine- 
rich protein that interacts with the viral packaging sequence RNA, was deleted. The 
lysine-rich basic region of the RRV capsid protein is shown in Figure 33. Further, the 
hydrophobicity profiles demonstrate that this lysine-rich basic region is progressively 

20 eliminated in the 3 individual RRV capsid gene mutants CAlrrv, CA2rrv. and CA3rrv. 
Figure 34 demonstrates the lysine residues eliminated in the expressed RRV capsid 
protein, as a result of the deletions in mutants CAlrrv, CA2rrv, and CA3rrv. The table 
shown below eives the nucleotides deleted in the RRV genome of constructs CAlrrv, 
CA2rrv, and CA3rrv. 



Nucleotides of the RRV capsid gene corresponding to predicted regions 



25 



Construct 

CAlrrv 



Deleted RRV penome nts. 
7841-7891 



CA2rrv 



7796-7891 



CA3rrv 



7760-7891 
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The RRV capsid gene deletions were constructed on the DH-BB Crrv plasmid DNA 
illustrated in Figure 23. and described in Example 6. The indicated RRV capsid gene 
sequences were deleted by PCR, using the pnmers and other cloning steps given in 
Example 6. 

5 Figures 31 and 32, discussed above, illustrate the virus-specific RNAs 

(Figure 31) and proteins (Figure 32) synthesized in BHK cells electroporated with 
SINrep/LacZ and the DH-BB CAlrrv, CA2rrv, or CA3rrv RNAs, and present in viral 
particles contained in the culture fluids of these cells. The genomic and subgenomic 
species were detected for both the SINrep/LacZ replicon and all the three DH-BB 

10 CAlrrv. CA2rrv, or CA3rrv DH RNAs in electroporated cells (Figure 31, panel A). 
However, the SINrep/lacZ replicon was not packaged in vector^ particles in cells 
electroporated with DH RNA containing deletions in the RRV capsid gene, as 
demonstrated by the absence of replicon genomic RNA in virus panicles (Figure 31, 
panel B). Further, helper genomic and subgenomic RNAs were packaged very 

15 inefficiently and were barely visible in autoradiograms of denaturing gels (Figure 31, 
panel B, lanes 3 and 4), when cells were electroporated with DH molecules containing 
larger deletions of the RRV capsid (CA2rrv or CA3rrv). In contrast, while SINrep/lacZ 
genomic RNA (and DH RNA in electroporations with CA2rrv or CA3rrv) was not 
detected in viral particles from BHK cells electroporated with the DHs containing 

20 deletions in the RRV capsid gene, equivalent RRV capsid protein and Sindbis virus 
glycoprotein levels were observed in virus particles from cells electroporated with all 
DH RNAs, regardless of whether the RRV capsid gene contained deletions (Figure 32, 
panels A and B). This result demonstrates that stable chimeric virus particles not 
containing vector replicon, or other viral-specific RNAs, were formed in BHK cells 

25 electroporated with SINrep/lacZ genomic RNA and DH RNA, from which capsid 
protein unable to bind to the genomic RNA was expressed. The formation of stable 
empty heterologous alphavirus panicles is unexpected and not predicted, based on the 
results and discussions of previous investigations (Lopez et. al., Virol. 65:1316-1323, 
1994, Sorgeret. al. J. Mol Biol. 797:639-656, 1986, Erickson and Rossmann. Virology 

30 //(5:128-136 ; 1982, and Rumenapf et. al. J. Virol, 69: 1741-1746, 1995). 
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To determine the composition of the virus particles produced. BHK cells 
were electroporated with SINrep lacZ and the DH RNAs containing various RRV 
capsid gene configurations, as described herein, and were treated subsequently with 
dactinomycin. and labeled with [ 35 S]methionine and [ 3 H]uridine, as described in 
5 Example 1. The configuration of the viral particles produced were determined by 
ultracentrifugation of clarified cell culture media for 2 hrs at 35,000 rpm in a SW-41 
rotor over a 20%-40% (w/w) sucrose gradient. The results of this study are shown in 
Figures 35-37, and demonstrate again the formation of stable empty heterologous 
alphavirus particles. Figure 35 demonstrates the relative levels of [ 3i S]methionine and 

10 [ 3 H]undine incorporated into panicles synthesized in BHK cells infected at high MOI 
(5) with wild-type virus, Totol 101. Figure 36 demonstrates that the relative levels of 
[ 35 S]methionine and [ 3 H]uridine incorporated into particles synthesized in BHK cells 
electroporated with SINrep/LacZ and DH-BB (5' tRNA) Crrv (Figure 23) RNAs was 
the same as in cells infected with wild-type virus. In contrast, Figure 37 demonstrates 

15 that the panicles produced in BHK cells electroporated with SINrep/LacZ and DH-BB 
(5* tRNA) CA3rrv contained very low levels of incorporated [ J H]uridine. Figure 38 is a 
compilation of Figures 35-37, and illustrates clearly that while the relative levels of 
[ 3S S]methionine and [ J H]uridine incorporated into panicles were similar in BHK cells 
infected or electroporated with RNAs containing wild-type alphavirus capsid genes. 

20 BHK cells electroporated with DH RNA containing deletions of nts. 7760-7891 of the 
-RRV capsid gene formed stable chimeric empty alphavirus particles, devoid of 
SINrep/LacZ RNA. Titers of empty alphavirus particles, produced in cell lines 
electroporated with with SINrep/LacZ RNA and DH-BB CD3rrv in vitro transcribed 
RNAs, and labeled with [ 35 S]methionine, were determined by comparison with BHK 

25 cells, infected with Toto 1101 wild-type virus, and labeled with [35S]methionine. The 
level of radioactivity present in virus-containing sucrose gradient fractions from Toto 
1 101 -infected cells was quanitated. and related to the virus titer present in these same 
fractions, as determined by plaque assay, according to the methods described in 
Example 1 . For empty alphavirus particle titer determinations, the level of radioactivity 

30 present in virus particle-containing sucrose gradient fractions from BHK cells 
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eiectroporated with STNrep/LacZ RNA and DH-BB CD3rrv in vitro transcribed RNAs. 
was quantitated, and related to the [ JS S]methionine/virus titer, from Toto 1101 infected 
cells. The titer of empty chimeric virus particles, containing the deleted Ross River 
virus capsid and the Sindbis virus glycoproteins, produced in SINrep/LacZ RNA and 
5 DH-BB CD3rrv in vitro transcribed RNA eiectroporated cells was 1 x 10 9 panicles/ml. 



While the present invention has been described above both generally and 
in terms of preferred embodiments, it is understood that variations and modifications 
will occur to those skilled in the an in light of the description, supra. Therefore, it is 
10 intended that the appended claims cover all such variations coming within the scope of 
the invention as claimed. 

Additionally, the publications and other materials cited to illuminate the 
background of the invention, and in particular, to provide additional details concerning 
its practice as described in the detailed description and examples, are hereby 
1 5 incorporated by reference in their entirety. 
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Claims 

We claim: 

1. An isolated nucleic acid molecule, comprising an altered alphavirus 
nonstructural protein gene which, when operably incorporated into a recombinant alphavirus 
particle, increases the time required to reach 50% inhibition of host-cell directed 
macromolecular synthesis following expression in mammalian cells, as compared to a wild- 
type alphavirus. 

2. An isolated nucleic acid molecule, comprising an alphavirus 
nonstructural protein gene which, when operably incorporated into a recombinant alphavirus 
panicle, has a reduced level of vector-specific RNA synthesis, as compared to the wild-type, 
and the same or greater level of proteins encoded by RNA transcribed from the viral junction 
region promoter, as compared to a wild-type recombinant alphavirus particle. 

3. An alphavirus vector construct, comprising a 5' promoter which 
initiates synthesis of viral RNA in vitro from cDNA, a 5' sequence which initiates 
transcription of alphavirus RNA, a nucleic acid molecule which operably encodes all four 
alphaviral nonstructural proteins including a nucleic acid molecule according to claims 1 or 2. 
an alphavirus RNA polymerase recognition sequence and a 3' polyadenylate tract. 

4. An alphavirus vector construct, comprising a 5 y promoter which 
initiates synthesis of viral RNA in vitro from cDNA, a 5 1 sequence which initiates 
transcription of alphavirus RNA, a nucleic acid molecule which operably encodes all four 
alphavirus non-structural proteins, an alphavirus viral junction region promoter, an alphavirus 
RNA polymerase recognition sequence, and a 3' poiyadenylate tract, wherein said in vitro 
synthesized RNA, upon packaging into an alphavirus particle and introduction of the particle 
into a mammalian host cell, increases the time required to reach 50% inhibition of host-cell 
directed macromolecular synthesis following expression in mammalian ceils, as compared to 
a wild-type alphavirus particle. 
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5. An alphavirus vector construct, comprising a 5 ; promoter which 
initiates synthesis of viral RNA in vitro from cDNA, a 5' sequence which initiates 
transcription of alphavirus RNA, a nucleic acid molecule which operably encodes all four 
alphavirus non-structural proteins, an alphavirus viral junction region promoter, an alphavirus 
RNA polymerase recognition sequence, and a 3' polyadenylate tract, wherein said in vitro 
synthesized RNA, upon packaging into an alphavirus particle and introduction of the panicle 
into a mammalian host cell, has a reduced level of vector-specific RNA synthesis as 
compared to wild-type alphavirus particle, and the same or greater level of protein encoded 
by RNA transcribed from the viral junction region promoter, as compared to a wild-type 
alphavirus particle. 

6. An alphavirus RNA vector replicon capable of translation in a 
eukaryotic system, comprising a 5' sequence which initiates transcription of alphavirus RNA, 
a nucleic acid molecule which operably encodes all four alphaviral nonstructural proteins, 
including a nucleic acid molecule according to claims 1 or 2, an alphavirus viral junction 
region promoter, an alphavirus RNA polymerase recognition sequence and a 3' polyadenylate 
tract. 

7. An alphavirus RNA vector replicon capable of translation in a eukaryotic 
system, comprising a 5' sequence which initiates transcription of alphavirus RNA, a nucleic 
acid molecule which operably encodes all four alphaviral nonstructural proteins, an 
alphavirus viral junction region promoter, an alphavirus polymerase recognition sequence and 
a 3' polyadenylate tract, wherein said alphavirus RNA, upon packaging into an alphavirus 
particle and introduction of the particle into a mammalian host cell, increases the time 
required to reach 50% inhibition of host-cell directed macromolecular synthesis following 
expression in mammalian cells, as compared to a wild-type alphavirus particle. 

8. An alphavirus RNA vector replicon capable of translation in a 
eukaryotic system, comprising a 5' sequence which initiates transcription of alphavirus RNA, 
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a nucleic acid molecule which operably encodes all four alphaviral nonstructural proteins, an 
alphavirus viral junction region promoter, an alphavirus polymerase recognition sequence and 
a 3 1 polyadenylate tract, wherein said alphavirus RNA, upon packaging into an alphavirus 
particle and introduction of the panicle into a mammalian host ceil, has a reduced level of 
vector-specific RNA synthesis as compared to wild-type alphavirus particle, and the same or 
greater level of protein encoded by RNA transcribed from the viral junction region promoter, 
as compared to a wild-type alphavirus particle. 

9. A pharmaceutical composition, comprising an alphavirus RNA vector 
replicon according to any one of claims 6, 7 or 8 and a pharmaceutically acceptable carrier or 
diluent. 

10. A recombinant alphavirus particle, comprising one or more alphavirus 
structural proteins, a lipid envelope, and an RNA vector replicon according to any one of 
claims 6. 7 or S. 

11. The recombinant alphavirus particle according to claim 10 wherein 
said alphavirus structural protein and lipid envelope are derived from different alphavirus 
species. 

12. A pharmaceutical composition, comprising a recombinant alphavirus 
particle according to claim 10 or 11 and a pharmaceutically acceptable carrier or diluent. 

13. A host cell infected with a recombinant alphavirus particle according to 

claim 1 0 or 1 1. 

14. A togavirus capsid panicle which contains substantially no genomic or 
RNA Vector Replicon nucleic acids. 
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15. The capsid panicle according ro claim 14 further comprising a lipid 
envelope containing one or more alphavirus glycoproteins. 

16. The capsid particle according to claim 14, further comprising an 
alphavirus envelope. 

17. The capsid particle according to claim 14 wherein said capsid is 
derived from a togavirus selected from the group consisting of alphaviruses, rubiviruses, 
fiaviviruses and pesiiviruses. 

IS. A pharmaceutical composition, comprising a capsid particle according 
to any one of claims 1 4 to 17, and a pharmaceutically acceptable carrier or diluent. 

19. An alphavirus structural protein expression cassette, comprising a 5' 
promoter which initiates synthesis of RNA from DNA, a nucleic acid molecule which 
encodes one or more functional alphavirus structural proteins, a selectable marker operably 
linked to transcription of the expression cassette, and a 3* sequence which controls 
transcription termination. 

20. Art alphavirus packaging cell line, comprising a cell containing an 
alphavirus structural protein expression cassette according to claim 19. 

21. An alphavirus producer cell line, comprising a cell containing a stably 
transformed alphavirus structural protein expression cassette, and a vector selected from the 
group consisting of an RNA vector replicon according to any one of claims 6 to 8 } an 
alphavirus vector construct according to any one of claims 3 to 5, and a eukaryotic layered 
vector initiation system according to any one of claims 22 to 24. 

22. A eukaryotic layered vector initiation system, comprising a 5' promoter 
capable of initiating in vivo the 5' synthesis of alphavirus RNA from cDNA, a sequence 
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which initiates transcription of alphavirus RNA following the 5' promoter, a nucleic acid 
molecule which operably encodes all four alphaviral nonstructural proteins, including a 
nucleic acid molecule according to claims 1 or 2, an alphavirus RNA polymerase recognition 
sequence, and a 3' polyadenylate tract. 

23. A eukaryotic layered vector initiation system, comprising a 5' promoter 
capable of initiating /;/ vivo the 5' synthesis of alphavirus RNA from cDNA, a sequence 
which initiates transcription of alphavirus RNA following the 5' promoter, a nucleic acid 
molecule which operably encodes all four alphaviral nonstructural proteins, an alphavirus 
RNA polymerase recognition sequence, and a 3' polyadenylate tract, wherein said in vivo 
synthesized RNA, upon packaging into an alphavirus particle and introduction of the particle 
into a mammalian host cell, increases the time required to reach 50% inhibition of host-cell 
directed macrornolecular synthesis following expression in mammalian cells, as compared to 
a wild-type alphavirus panicle. 

24. A eukaryotic layered vector initiation system, comprising a 5' promoter 
capable of initiating in vivo the 5' synthesis of alphavirus RNA from cDNA, a sequence 
which initiates transcription of alphavirus RNA following the 5' promoter, a nucleic acid 
molecule which operably encodes all four alphaviral nonstructural proteins, an alphavirus 
RNA polymerase recognition sequence, and a 3' polyadenylate tract, wherein said in vivo 
synthesized RNA, upon packaging into an alphavirus particle and introduction of the panicle 
into a mammalian host cell, has a reduced level of vector-specific RNA synthesis as 
compared to wild-type alphavirus particle, and the same or greater level of protein encoded 
by RNA transcribed from the viral junction region promoter, as compared to a wild-type 
alphavirus particle. 

25. A host cell containing a eukaryotic layered vector initiation system 
according to any one of claims 22 to 24. 
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26. A pharmaceutical composidon, comprising a eukaryotic layered vector 
initiation system according to any one of claims 22 to 24 and a pharrnaceutically acceptable 
carrier or diluent. 

27. A method for delivering a selected heterologous sequence to a 
vertebrate or insect, comprising administering to a vertebrate or insect an alphavirus vector 
construct according to any one of claims 3 to 5, an alphavirus RNA vector replicon according 
to any one of claims 6 to 8, a recombinant alphavirus particle according to claim 10, or a 
eukaryotic layered vector initiation system according to any one of claims 22 to 24. 

28. A method for stimulating an immune response within a vertebrate 
comprising administering to a vertebrate an alphavirus vector construct according to any one 
of claims 3 to 5, an alphavirus RNA vector replicon according to any one of claims 6 to 8, a 
recombinant alphavirus particle according to claim 10, or a eukaryotic layered vector 
initiation system according to any one of claims 22 to 24. wherein said alphavirus vector 
construct. RNA vector replicon, particle, or eukaryotic layered vector initiation system 
expresses an antigen which stimulates an immune response within said vertebrate. 

29. A method for inhibiting a pathogenic agent within a vertebrate, 
comprising administering to a vertebrate an alphavirus vector construct according to any one 
of claims 3 to 5, an alphavirus RNA vector replicon according to any one of claims 6 to 8, a 
recombinant alphavirus particle according to claim 10, or a eukaryotic layered vector 
initiation system according to any one of claims 22 to 24, wherein said alphavirus vector 
construct. RNA vector replicon, particle, or eukaryotic layered vector initiation system 
expresses an palliative which is capable of inhibiting a pathogenic agent. 

30. A method of making recombinant alphavirus particles, comprising: 

(a) introducing a vector selected from the group consisting of a eukaryotic 
layered vector initiation system according to any one of claims 22 to 24, an RNA vector 
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replicon according to any one of claims 6 to 8, and a recombinant alphavirus vector panicle 
according to claim 10 ; into a population of packaging cells according to claim 20, under 
conditions and for a time sufficient to permit production of recombinant alphavirus particles; 
and 

(b) harvesting recombinant alphavirus particles. 

31. A method of making a selected protein, comprising: 

(a) introducing a vector which encodes a selected heterologous protein, 
and which is selected from the group consisting of a eukaryotic layered vector initiation 
svstem according to any on of claims 22 to 24, an alphavirus RNA vector replicon according 
to anv one of claims 6 to 8, and a recombinant alphavirus vector particle according to 
claim 10, into a population of packaging cells according to claim 20, under conditions and for 
a time sufficient to permit production or said selected protein; and 

(b) harvesting protein produced by the packaging cells. 

32. A method of making a selected protein, comprising introducing a 
eukaryotic layered vector initiation system according to any one of claims 22 to 24 into a host 
cell, under conditions and for sufficient to permit expression of said selected protein. 

33. A host cell line which contains an alphavirus RNA vector replicon 
according to any one of claims 6 to 8. 
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ATTGACGGCG TAGTACACAC i A77GAA7CA AACAGCCGAC CAATCGCACT ACCA7C-C-A ^0 
7GGAGAAGCC AG7AG7AAAC G7AGACG7AG ACCCCCAGAG TCCG777G7C G7GCAAC7GA 'h 
AAAAAAGC77 CCCGCAA777 GAGG7AG7AG CACAGCAGG7 CAC7CCAAA7 GACCA7GC7A ISO 
A7GCCAGAGC AT777CGCAT C7GGCCAG7A AACTAATCGA GG7GGAGG77 CC7ACCACAG 240 
CGACGA7C77 GGACATAGGC AGCGCACCGG C7CG7AGAA7 G7777CCGAG CACCAGTA7C 300 
AnGfGiC7G CCCCA7GCG, AG7CCAGAAG ACCCGGACCG CA7GA7GAAA TACGCCAG7A 360 
AACTGGlGGA AAAAGCG7GC AAGA77ACAA ACAAGAAC77 GCA7GAGAAG AT7AAGGATC 4£Q 
7CCGGACCG7 AC77GA7ACG CCGGA7GC7G AAACACCA7C GC7C7GCT77 CACAACGA7G 480 
77ACC7GCAA CA7GCG7GCC GAA7A77CCG TCA7GCAGGA CG7G7A7A7C AACGC7CCCG 5*Q 

GAACTA i CTA TCATCAGGC7 A7GAAAGGCG TGCGGACCC7 G7AC7GGA77 GGC77CGACA 6G0 

CCACCCAGiT CA7G77CTCG GC7A7GGGAG G77CG7ACCC TGCG7ACAAC ACCAAC7GGG - 660 

CCGACGAGAA AGiCCTTGAA GCGCG7AACA 7CGGAC777G CAGCACAAAG C7GAGTGAAG 720 

uTAGGACAGu AAAA77G7CG A7AA7GAGGA AGAAGGAG77 GAAGCCCGGG 7CGCGGG777 7go 

Ai MC7CCG7 AGGA7CGACA C777A7CGAG AACACAGAGC CAGC77GCAG AGC7GGCA7C 340 

liCCAiLGui ui7CCAC7:G AA7GGAAAGC AGiCG7ACAC 77GCCGC7G7 GA7ACAG7GG G G0 

'GAG;!^-^ AGGC7ACG7A G7GAAGAAAA TCACCA7CAG 7CCCGGGA7C ACGGGAGAAA 960 

CCG7GGGA7A CGGGG77ACA CACAA7AGCG AGGGC77C77 GC7A7GCAAA "~^C7GAC^ 1GE0 

CAG7AAAAGG AGAACGGG7A TCGTTC::7G 7G7QCACG"A Z^KZZ^:: ACGA7A7GCG iOSO 
A!LAGA;uAi :Gu:AiAA.j GGGACGGA7A "a'CACCGA CGA7GCACAA AAAC77C7GG 

77GGGC~CAA CGAOCGAA" G7CA77.-ACG G7ACGAC T A- CAGGAACACC AACACCA"C I ECO 

AAAA77ACC7 7C7GCCGA7C A7AGCACAAG GG77CAGCAA A7GGGC7AAG GAGCGCAAGG iho 

AiGAil.^A : AACGAGAAA A7GC7GGG7A C7AGAGAAGG GAAGC77ACG 7A~GGC7GC T 13E0 

l^^L :^ CACTAAG -:^G7ACA7- CG7777A7CG CCCACC7GGA ACGCAGACG' 13S0 

vjlG i AAA AG : CC:aGG:7C7 ; ,7AGCGC~7 77CGCA7G7: G7GGG7A7GG ACGACC7C"" 1440 

7GCCGA7G7C GG7GAGGGAG AAA77GAAAC 7GGGA77GCA ACCAAAGAAG GAGGAA-AAC 1300 

-u.ulAG^: CiCGuAGGAA 77AG7CA'GG AGGCCAAGGG 7GC""GAG GA"GC7GAGG i360 

•MjuAAGl.AIj AGlGuAGAAG C7CGGAGAAG CAC T 7CCACG A77AG7GGCA GACAAAGGCA 1620 

.lGAGulAGG CGCAGAAO" G7C7GCGAAG TGGAGGGGC 7 CCAGGCGGAC A7CGGAGGAG 1630 

CAiiAGmuA AACCGCGCGC GG7GACG7AA GGA7AATACG 7CAAGCAAA7 GACCG7A7GA 1740 

TCGGACAG7A 7A7CG7TG7C 7CGCCAAAC7 C7G7AC7GAA GAA7GCCAAA C'CGCACCAG 1800 
CGCACGCGG7 AGCAGATCAG G7TAAGA7CA 7AACACAC T C CGGAAGA7CA GGAAGG7ACG 
CGGiCGAACG A7ACGACGG7 AAAG7AC7GA TGCCAGCAGG AGG7GCCG7A CCA7GGCCAG 

AAi ,CC i AGG AC7GAGiGAG AGCGCCACG7 7 AG7G7ACAA CGAAAGAGAG C77G7GAACC I960 

GlAAAC:A,A CCACA77GCG A7GCA7GGCC CCGCCAAGAA 7ACAGAAGAG GAGCAG7ACA 8040 

AGu.iACAAA GGCAGAGC77 GCAGAAACAG AG7ACG7G7" 7GACG7GGAC AAGAAGCG77 P i GO 

GlGmAAGAA GGAAGAAGGG TCAGuTCTGG TCC7C7CGGG AGAAC7GACC AACCCTCC" 2* 60 

AiCA7GAGC7 AGC7C7GGAG GGAC7GAAGA CCCGACG T GC GG7CCCG7AC AAGG7CGAAA 2EE0 

CAAi AGuAL*i GA7AGGCACA CCGGGG7CGG GCAAG7CAGG 7A77A7CAAG 7GAAC7G7CA 22S0 

CGGCACGAGA 7C77G77AGG AGCGGAAAGA AAGAAAA7" 7CGCGAAA77 GAGGGGGACG 23^0 

iLuAAGACi GAGGGG7A7G CAGA77ACG7 CGAAGACAG7 AGA77CGG77 A7GG7CAACG 34G0 

GAil-lGACAA AGCCG7AGAA GiGCTGTACG 77GACGAAGG G77CGGG7GG CACGCAGGAG 2460 

GAG i AC ! i lG C77GA77GG7 A7CG7CAGGC CCCGCAAGAA GG7AG7AC7A 7GCGGAGACG 2520 
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CCA7GCAA7G CGGAT7CTTC AACA7GA7GG AAC7AAAGG7 ACATTTCAAT "AC"7GAAA 
AAGACATA7G CACCAAGACA 77C7ACAAG7 A7A7C7CCGG GGG77GCACA .^AGTA 

CAGuiHiiui AiCGACACTG CAT7ACGATG GAAAGA7GAA AACCACGAAC "G7GCAAGA "nn 

^GATATT ACAGGGuCCA CAAAGCCGAA GCCAGGGGA7 A7CA7Cc1ga « S 

r/r 'llr-~ ^ uu,Guu " WCAATiGC AAATCGACTA TCCCGGACAT GAAG7AA7GA kcO 

^ uCACAAGGG CTAACCAGAA AAGGAG7G7A 7GCCG7CCGG CAAAAAGTCA 2880 

tlr^rr^ 1 ; AC .1 G,ACGCG ATCACATCAG AGCA7G7GAA CG7G77GCTC ACCCGCAC7G ?9*0 

S^SI ^!^ GGAAA A CC7TGCAGG GCGACCCA7G GA77AAGCAG C7CAC7AACA 3000 

I A ^I AAAG ^ AAAGT lI CAG CCTAC7A7AG AGGAC7GGGA AGC7GAACAC AAGGGAATAA 3060 

II GG I G !: AAT AAA "^r G AC7CCCCG7G CCAA7CCG7T CAGC7GCAAG ACCAACG7H- 317-0 

7>; G ^ GAA AGGAT :^ AA CCGATACTAG CCACGGCCGG 7A7CG7AC77 ACCGG77GCC 3180 

AGTGGAGuuh ACiGiiCllA CAG7T7GCGG ATGACAAACC ACA7TCGGCC A777ACGCC7 "?<0 

TAG«Cu.Ahi i .'GcATiAAG 777TTCGGCA 7GGAC77GAC AAGCGGAC7G 7777C7AAAC rsnn 

££5™ A77CAGCGAG GCCGG7AGC7 CA7"GGACA 2360 

^•---^r-- »- wUt 0 ' Al0uu,ACj A7CACGCCA7 7GCCGCCGAA C7C7CCCG7A WO 

^{■['•--,- CCTGGGAAGG GCACACAAC- "GA777GCAG ACGGGGAGAA 3*60 

LL.AGAGi \h.i t.uutACAG CA7AACC7GG 7CCCGG7GAA CCGCAA7C77 CrCAC"" ~**Q 

lAGrC^GA uTACAAGGAG AAGCAACCCG GCGGGGTCGA AAAA""~G ^"^h " 



36C0 



J I .-l-Uf-UAr; , JbCO 

■J I —'U 



4ECC 



f-I^'!::^ C CC ^A7 T GGC A7ACGCGG7G GACATAAGAA "AGAACG7G GCTTTCGG^ 

^cGl.ac gacctggtgt tca'caaca- -ggaactaaa tacagaaac: 375c 

AC.Al; ■ ;iA GlAC:l^AA gaccatgcgg ggac:~a±a AACGGTCG C2~r^:~ 7s 4 n 

I^cagga 5P C7C '^ tgg"aag" ctatgggtac gcggacggga 290c 

AL^i^LuA ll: i hQ i *jl iCTTGCCA GAAAG7~7G7 :aGG~~G7G~ GG-G"-G^ ^r, 

1-wAiimGi uiAAGu-^ ^CAGAAA-G7 AC:"-i7— "'^C-AC"i ■"-C^'CG-" H G ;r 

G7ACACGGCA A77CAC:::: :ac:a;c~a at"cg7ga7 -tc:"cg7g : a~gaggg : a ceo 

-AAGAGAiiju AG : i GuAGCG uCGCCG7CA7 ACCGCA^G-A aaG"agaA t -~~~r-c.±c- 
G7CAAGAGGA. AGCAG77G73 AACGCAGG3A A7C3GG7GGG tagIc'aG"" GaaIv.ao?" 

^; G ; G " A i C7A7AAAGG7 iGGCCGACCA G7777ACGGA 77CACCCACG GACACAGGCA - C5U 

^f^l ^'C;GTGC C7AGGAAAGA AAG7GA7C3A CGCGG7CGGC CC7GA777CC 4229 

G " A :^G G r AGAAG ^ C;A GCC77GAAA7 7GC7ACAAAA CGCC7ACCA7 GCAG7GGCAG- *3S0 

J:: A ^^ TGAACA7AAC A7CAAG7C7G TCGCCA77CC AC7GC7A7C7 ACAGGCA777 *UQ 

ACuLnGttGu AAAAGACCGC C77GAAG7A7 CAC77AAC7G C77GACAAC" GC'C'AGAC-i ^ C G0 
GAAC7GACGC GGACG7AACC A7C7A77GCC 7GGA7AAGAA G7GGAACGAA AGAA7CGAT1 

C^cACiCCA AC .- 1 AAGuAvj 7C7G7AACAG AGC7GAAGGA 7GAAGA7A7G GACA7CGACG *20 

niG«u^Gi AiGuATCCAT CCAGACAG77 GC77GAAGGG AAGAAAGGGA 77CAG7AC7A 4680 

C>>«Ahju*AA A I iG7A77CG 7AC77CGAAG GCACGAAA7" ■~G- Tr --GCA "C-AAAG-C- i'iQ 

iGuC^AbA; AAAGG7CC7G 77CCC7AA7G ACCAGGAAAG 7AA7GAACAA C7G7G7GCC" 4SC0 

frtlt'^ 'GACACGA'G GAAGCAA7CC GCGAAAAG7G CCCGG7CGAC CA7AACCCG7 4S6G 

i^;itr^- Gl ; uAAAAl - 1 'GCCG7GCC 777GCA7G7A 7GCGA.7GACG CGAGAAAGGG 45c0 

iCvACAGACi iAGAAGlAA7 AACG7CAAAG AAG7"ACAG7 47^--".- i C "'' r '"-- 4=80 

CiAAGCACAA AA77AAGAA" G77CAGAAGG 77CAG7GCAC GAAAG7AG7C C7G777AA7C 50*0 
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GG i AGAG I!:: CGCATTC; jiT CCCGCCGG7A AGTACATAGA AGTGCCAGAA C.AGCCTACCG 

^iluCuul ACAGGCCGAG GAGGGCCCCG AAG7TG7AGC GACACCGTCA CCAiC'ACAG ^.n 

CTCGCTTGAT GTCACAGACA TC7CAC7GGA TATGGATGAC AG7AGCGAAG $0 

t^r-\r- i!™;:' ? GCGGATCGG ACAACTCTAT TACTAGTATG GACAGTTGGT 52S0 

^^ AC ; J™«i$ GAGATAGTAG ACCGAAGGCA GGTGGTGGTG GC7GACG77C 40 

AiGl.uiClm AGAGu.;Gl~ CuA, iCCAC CGCCAAGGGT AAAGAAGA7G mCCCGCCTGG S4nn 

CAGCGGCAAG AAAAGAGCGC ACTCCACC GG CAAGCAA7AG CTC7GAG7CC C7CCAD 7C7 lZ 

r^™ ISS-S ISSS 7 CAATTTTCGA CGGAGAGAGG GCCCGCCAGG 5520 

rrfArr-Arl S-frlK AGAGuCCCCA CGGA7G7GCC 7A7G7C777C GGA7CG7777 5580 

CCGACGuAGA GAiTGA.GAG C i GAGCCGCA GAG7AAC7GA GTCCGAACCC G7CC7G777G 5640 

G7GAAC7CAA T7A7A7CG7C CCGATCAGCC G7ATC77T7C " 5700 

i{ Z 't^ kAGAGACa. AGACGCAGGA GCAGGAGGAC TGAATACTGA C7AACCGGGG 5760 

!^'&JuiA CAiAi i i7CG ACGGACACAG GCCC7GGGCA C77GCAAAAG AAG7CCG77C '820 
ibCAGAAC.A Gl i i ACAGAA CCGACC77GG AGCACAA7G7 CC7GGAAAGA A" CiTGC r- 
C^,l-l;lGA CACG7CGAAA GAGGAACAAC 7CAAAC7CAG G7AC:AGA7G A7GC:CACC : - 
AAGluAACAA AAoiAGuiAL CAG7C7CGTA AAG7AGAAAA TCAGAAAGCC ^AC"AC T G 
AGlGAC:AC. G7CAGGACA CGAC7G7A7A AC7C7GCCAC AGA7CAGCCA 'AA7GC'ata 

AGA.lAC.A iCCGAAACGA 77G7ACTCCA G7AGCG7ACG GGCGAACTAC 7CG"A"i""Ar _., a 

G 1;^: G : «^7C7.:t aacaacta;: :gca::agaa c-a-gaca "ag:a : :7- siic 

r™:: uA '- : : AC *ga7a7gg7 agacgggaca g7cgc:-gc: 6£^o 

I^^!:;^ L ~ ! ,L -'- ! "^'-7AAGC T7AGAAG77A CC:G.-AAAAA CA7GAC-A7A 5-00 
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JL : u ^ Hr i h i llul.AG" GCGGT'CCA" CAGCGA7GCA GA^C.-CGC'A CA^A-~ : ^~G r 0^6 
iLAi.G^L AAC7AAA4GA AA7TGCAACG "CACGCAGA 7 GCG7GAACG CCAAC^l'-' 

r-L : L.-iUL^l .-1 i ; Lnn : l: -jAA i L'L i ' ■ i :-AAAA7A"V ±~"' % T^ r "AfAT- /,cn 

rr _, . - ~r-n.„L, " hMr ^ .^lUrc ^IJ^: o4gU 

ruu^u 1 . luu iC^uAf-Gu^ Ai , AUGA77A CCAC7GAG" 7 -7C-C rr ^^ ^TG'^GG^ ^<in 

—u-r-v: ,;u„ i Hiuuh lA 1 ACA7GAAAAG AGACG7GAAA G"AC^C2-G 6660 

GtACGAAACA CACAGAAGA- AGACCGAAAG 7ACAAG7GA7 ACAAGCCGCA GAAC::C7GG 6720 

G ^i^;; A ;:' AiGL ^ u A-TCACCGGG AA77AG7l:G TAGGCTTACG GCCGTCTTGC S780 

!;^ AAAGA : I?ACACul77 77TGACA7G7 CGGCGGAGGA 7777GA7GCA A7CA7AGCAG 63*0 

™™ G ":^ L ' AC CCGG7AC7GG AGACGGA7A7 CGCA7CA77C GACAAAAGCG. 6900 

A ™ AGG 7, I A : Gb, ;;: iA ACCGG7C7GA 7GA7C77GGA GGACG7GGG7 G7GGA7CAAC 6560 

ChCihCiCuA uiiuA7uAG TGCGCC777G GAGAAA7A7C ATCCACCCA7 C7ACC7ACGG 7020 

biACiCuiM i AAAi iCGGG GCGA7GA7GA AA7CCGGAA" G77CC7CACA C77"77G7C- "080 

™ G .: , 1: ^AiUiCG77 A7CGCCAGCA GAG7AC7AGA AGAGCGGC77 AAAACG7CCA 7^0 

GA i u i ulAGl ui.lAi.uGC GACGACAACA TCATACA7GG AG7AG7A7C7 GACAAAGAAA 7c*Q 

l-t^r^ G '^ GtCAC: 7GGC7CAACA 7GGAGG77AA GA7CA7CGAC GCAG7CA7CG 7E60 

G ^ AG j: G : G i: : G ;;; AC7TC "GCGGCGGA7 --ATC7TGCA AGA7TCGGT7 AC77CCACAG 7220 

GG ; ^gaaaaggc 7G77taag7- ggg7aaac:g :"::agc:g 7 -so 

hC-^CohGlA AGAtuAAbAu AGAAGACGCG C-C7GC7AGA TGAA^C-AAG ^""""i 7"G 

GAi t:; AG ^ A : AACAGGCAC- 77AGCAG7GG CCG7GACGAC CCGG7A7GAG G7AGACAA7A 75C0 

i^C.-.CuiG! UiACibcCA 77GAGAAC77 77GCCCAGAG CAAAAGAGCA 77CCAAGCCA 7560 
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SSSa c A tI^ T CTC ™GG7G GTCCTAAATA GTCAGCATAG TTCATTTCAT 76?Q 

S fffiKK S MGATT CTTTAACATG CTC TO 

SSSrr rrrrl^ GGCC ^GGAG AAGGAGGCAG GCGGCCCCGA 7740 

TAGTCATTrr EE?** TCCAGCAACT GACCACAGCC 6TCAGT6CCC 7800 

JcSSSffi SSSS KE SS CCCCATGTCC ACGCCCGCCA CCGC ^AGA 7 0 
aSaGcS SSf KS^P CGAAGAAACC AAAAACGCAG GAGAAGAAGA ' 7920 

S S ACCCGbAA AGAGACAGCG CATGGCACTT aagttggagg ™ 
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ATTGACGGCG 7AG7ACACAC TA7TGAATCA AACAGGCGAC CAATTGCACT AC3A7CACAA 60 

TGGAGAAGCC AG7AG7AAAC GTAGACGTAG ACCCCCAGAG TCCGTTTGTC GTGCAACTGC 1E0 

AAAAAAGCTT CCCGCAATTi GAGG7AG7AG CACAGCAGGT CACTCCAAAT GACCATGCTA 180 

ATGCCAGAGC ATTT iCGCAT CTGGCCAGTA AAC7AATCGA GCTGGAGGT7 CCTACCACAG 2^0 

CGACGATCTT GGACATAGGC AGCGCACCGG CTCGTAGAAT GTTTTCCGAG CACCAG7A7C 300 

ATTGTGTCTG CCCCATGCGT AGiCCAGAAG ACCCGGACCG CATGATGAAA TACGCCAGTA 360 

AACTGGCGGA AAAAGCGTGC AAGATTACAA ACAAGAACTT GCATGAGAAG ATTAAGGATC 4£0 

TCCGGACCGT ACTTGATACG CCGGATGCTG AAACACCATC GCTCTGCTTT CACAACGATG 480 

TTACCTGCAA CATGCGTGCC GAATATTCCG TCATGCAGGA CGTGTATATC AACGCTCCCG 540 

GAACTATCTA TCATCAGGCT ATGAAAGGCG TGCGGACCCT GTACTGGATT GGCTTCGACA 600 

CCACCCAGTT CATGTTCTCG GCTATGGCAG GTTCGTACCC TGCGTACAAC ACCAACTGGG ■ 660 

CCGACGAGAA AGTCCTTGAA GCGCGTAACA TCGGACTTTG CAGCACAAAG CTGAGTGAAG 7E0 

GTAGGACAGG AAAAT iGTCG ATAATGAGGA AGAAGGAGTT GAAGCCCGGG 7CGCGGG777 780 

ATTTCTCCG7 AGGA7CGACA CTTTATCGAG AACACAGAGC CAGCTTGCAG AGCTGGCATC 34C 

77CCA7CGG7 GTTCCACTTG AATGGAAAGC AGTCGTACAC T7GCCGCTGT GA7ACAG7GG 900 

7GAG77GCGA AGGC7ACGTA GTGAAGAAAA 7CACGA7CAG 7CC3GGGA7C ACGGGAGAAA 960 

CCG7GGGA7A CGGGG77ACA CACAA7AGCG AGGGC77C77 GC7A7GCAAA G77AC7GACA 10E0 

CAGTAAAAGG AGAACGGG7A TCGTTC3ZTG 7G7GCACG7A CA7CC:GGCC AC3A7A7GCG 1080 

A7CAGA7GAC 7GG7A7AA7G GCGACGGA7A 7A7CACG7GA CGA7GCACAA AAACT7C7GG 1 MO 

TTGGGC7CAA CCAGCGAA77 G7CA77AACG 37AGGAC7AA CAOGAACACG AACACCA7GC iECQ 

AAAA77ACG7 7C7GCGGA7C A 7 AGO AC A AG GG77CAGCAA A7GGGC7AAG GAOCGCAAGG 1ES0 

ATGA7C77GA 7AACGAGAAA A7GC7GGG7A C7AGAGAACG CAAGC77ACG 7A7GGC7GC7 13EQ 

TGTGGGCGi 7 "CGCAC7AAG AAAG7ACA77 CG77-7A7CG CGGACG'GGA ACGCAGACG7 1380 

GGG7AAAAG7 CZCAGCC7C7 j j TAGGGCT7 77C3CA7G73 G7CGG7A7GG ACGACG7C77 1440 

7GCCCA7G7C GC7GAGGCAG AAA77GAAAC 7GGCA7"GCA ACCAAAGAAC GAGGAAAAAC I 3G0 

7GC7GCAGG: C'CGGAOGA^ 7"AG7CA7GG AGGCGAAGGC : G" t "GAG GA7GC T CAGG 1360 

AGGAAGCCAG AGO GG AG A AG C7CGGAGAAG CAC77CGAGC A77AG7GGCA GACAAAGGCA 16E0 

TCGAGGCAGC C5CAGAAG77 G7C7GCGAAG TGGAGGGGC7 CCAGGCGGAC A7CGGAGCAG 1650 

CA77AG77GA AACCCCGCGC GG7CACG7AA GGA7AA7AC3 7CAAGCAAA7 GACCG7A7GA 1740 

7CGGACAG7A 7A7CG77G7C TCGCCAAAC7 CTG7GC7GAA GAA7GCCAAA C7CGCACCAG 1300 

CGCACCCGC7 AGCAGATCAG G77AAGA7CA 7AACACAC7C CGGAAGATCA GGAAGG7ACG 1860 

CGGTCGAACC ATACGACGC7 AAAG7AC7GA TGCCAGCAGG AGG7GCCGTA CCATGGCXAG 19£G 

AA77CG7AGC AC7GAG7GAG AGCGCCACG7 7AG7G7ACAA CGAAAGAGAG 777G7GAACC 1980 

GCAAAC7ATA CCACATTGCG A7GCA7GGCC CCGCCAAGAA TACAGAAGAG GAGCAG7ACA EO^O 

AGG77ACAAA GGCAGAGC77 GCAGAAACAG AG7ACG7G77 TGACG7GGAC AAGAAGCG77 2 100 

GCG77AAGAA GGAAGAAGCC 7CAGG7C7GG 7CC7C7CGGG AGAAC7GACC AACGC7CCCT E 160 

ATCA7GAGC7 AGC7C7GGAG GGAC7GAAGA CCCGACC7GC GG7CCCG7AC AAGG7CGAAA E2E0 

CAA7AGGAG7 GA7AGGCACA CCGGGG7CGG GCAAG7CAGC 7A77A7CAAG 7CAAC7G T CA E28G 

CGGCACGAGA 7C77G77ACC AGCGGAAAGA AAGAAAA77G 7CGCGAAA77 GAGGCGGACG 23*C 

7GG7AAGAC7 GAGGGG7A7G CAGA77ACG7 CGAAGACAG 7 AGA77CGG77 A7GC7CAACG E4CG 

GA7GCCACAA AGCCG7AGAA G7GC7G7ACG 77GACGAAGC G77CGCG7GG CACGCAGGAG E46G 

CAC7AC77GC 077GA77GC7 A7CG7CAGGC CCCGCAAGAA GG7AG7AC7A 7GCGGAGACC E5E0 
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2700 
2760 
2920 



CCATGCAATG CGGATTCTTC AACATGATGC AAC7AAAGG7 ACAT77CAAT :"-C r CTGAAA ^sn 

AA ACA ATG CACCAAGACA F7C7ACAAG7 A7A7C7CCCG GCGTTGCACA CAGCCAGTTA isl 

Sr^-r! LATiAC5A7G '3AAAGATGAA AACCACGAAC CCjJGCAAGA 

AGAACAmGA AATCurt.An ACAGGGGCCA CAAAGCGGAA GCGAGGGGA7 A7CATCC7GA 

?J?'ll™i r^f'Tr-- AAGCAATTGC AAATCGAC7A TCCCGGACAT GAAG7AATGA 

SS^rr L ! AACCAGAA AAGGAG7G7A 7GCCG7CCGG CAAAAAG7CA ?880 

JrSftf-f- frrr-r cG ATCACATCAG 4GCA7G7GAA CG7G77GCTC ACCCGCACTG 29*0 

AGuACAGuCi AGTGiGGAAA ACCT7GCAGG GCGACCCATG GA77AAGCAG CCCACTAACA Son 

Tier fJ XJEISS 5' TACTATAG AGGACTGGGA AGGTGAACAG AAGG GaJS 
IIS !^ AA 1 AA ACTCG CCG7G CCAA7CCG77 CAGC7GCAAG ACCAACG7I7. 120 

f'^Trr, AG ^:i G " AA CCGATAC7AG CCACGGCCGG 7A7CG7AC77 ACCGGTTGCC 

AG ^ AGGGA A E^ liCCCA CAGTT7GCGG ATGACAAACC ACA77CGGCC A7T7ACGCC7 

!^ A ^I AA ! nuCAnAAG 77777CGGCA TGGAC77GAC AAGCGGAC7G 7777C7AAAC 

AGAG : 1^ S^TAC CA7CCCGCCG A77CAGCGAG GCC3G7AGCT CA77GGGACA 



3130 
3240 
3300 
3360 



348G 
3340 



3900 
3960 
4G20 



ACAGCCCAGG AACCCGCAAG 7A7GGG7ACG A7CACGCCA7 7GCCGCCGAA t:;:Cu37A *£n 

^\r\r-^' 't'^r GCTGGGAAGG GCACACAAC7 7GA777GCAG ACGGGGAGA. ' " " 
CCAWiiAi uCiixACAG CA ; AACC7GG 7CC3GG7GAA CGGCAA7C77 C-7C4C"~ 

tagiC:::ga g7acaaggag aaccaac"- v---,--^ — ■- ",J:\.±;, 

.-.ACnt-AC 1 1 AG i AC ; i u . iji A'CAGAG" .iAAAAAT-'A xr.r-.——- .<•<-,,-.. 1^ 

.-, : ,Gl-. : :lGl l^Cuh ; : Guu A.ACw. j CAGA^AAGAA G'ACAACZ'G " — 31-0 

[; G :i:'i::^ A G ^ AG ^; AC GAC37GG7G" 7CA7CAACA7 "GGAAC'AAA 7ACAGAAACG 3780 

AClAi .:CA utAC^AA GACCA7GCGG 3GAC3"AAA CG ""CGC" -Jr. 

iGAAi ,lC37 7AACC3AGGA GGCAC:37CG 7GG7GAAG T 3 "AT""- ^"ir"" " 

SS GA £*7C«: GC7C77GCCA C-AAAG7 G7 CAGGG""7 GCAGCGAGA 
CAb*:i«iUi L i LAAlxAA ■ ACAGAAA7G7 ±C~~'±-— fi.cr- 
G7ACACGGCA 477CAC:::G CACGA'C'GA ^""-i: ----- 
Lnhbnb.-.cu -.Giiuurtu.. ui.0L.ji LA. ACCGCACGAA i^,"iGii7 i:--."GAC- <'40 

G I CAAGAGGA AGG V G : 7G I G AACGCAGCCA A7CCGC7GGG 7AGACCAGGC GAAGGAG7C7 4200 

^fitfl l^-GACCA G7777AC3GA T7CAGCCACG GAGACAGGCA 4260 

r-s\r~ rr- ^Ir^ C,AGuAAAGA AAG7GA7CCA CGCGG7CGGC CC7GA77TCC 4320 

GlAAGlA C, AGAAGCAGAA GCC77GAAA7 TGC7ACAAAA CGCC7ACCA7 GCAG7GGCAG 4 38 0 

f " ' So ^ A7CAAG7C7G 7CGCCA77CC AC7GC7A7CT ACAGGCAT7T £1 

f A t^ ^ AAGAG r± G :^ AAG " A " cacttaactg c~gacaac: gcstagaca 4300 

rr^ 'r--^ GGACiJlAAC '- A«.<A77GCC 7GGA7AAGAA G7GGAAGGAA AGAATCGACG i=60 

'nr--r- f-'r^- TCTGTAACAG AGCTGAAGGA *»»™G GAGA7CGAC 4sS 

: S-Jl A l G ^;^j. CGAGACAG77 GC77GAAGGG AAGAAAGGGA 77CAG7AC7A 4630 

CpAAnu,AAA AnuiAiiU I AC7TCGAAG GCAC3AAA77 CCA7CAAGCA GCAAAAGACA 4740 

^1^:9 "c::7aa7g accaggaaag taa^gaacaa cigtctgcct 4sco 

AGA ;.;:^ l^- 1 ^ ^l^zz gcgaaaag7g c::gg7cgac cataacgc:-- 4seo 

LuiLihcL.v ulllAAAACj i ilC:G7GC: 7TTGCA7GTA T^C*i*f-i'"' C'iGAiAGi" 4c;n 

r T Sf AG r i! !^ AC : AA7 ^-CAAAG AAG77ACAG7 A^rC^C, AC33CCC7 C 49 

CiAAG,.-LAA AAiiAALAA, CAGAAGG 77CAG7GCAC GAAAG7AG7C C7G777AA" 3G40 
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^^Sr«S ^^IlhrlJ !:^I£ CCSTA AG7ACA7AGA AG7GCCAGAA CAGCCTACCG 5100 

5160 
5£20 
5c SO 

CG7CAGGACC 7AG77CAC7A GAGA7AG7AG ACGGAAGGGA GG7GG7GG7G GC7GACG^7C £aq 
AiGCluiCCA AGAGCC7GCC CC7A77CCAC CGCCAAGGC7 AAAGAAGA7G GCCCGCCTGG ^400 
AAAAGAGCCC ACTCCACCGG CAAGCAA7AG CTCTGAGTCC CTCCACCTC7 5*60 
CTi mGuiGu Gu.AiCCAiG 7CCC7CGGAT CAA7777CGA CGGAGAGACG GCCCGCCAGG 5-20 
CAGLuu i ACA ACCCC7GGCA ACAGGCCCCA CGGA7G7GCC 7A7G7C777C GGA7CG7777 5*80 
CCGACuuAGA GA i 7GATGAG C7GAGCCGCA GAG7AAC7GA G7CGGAACCC G7CC7G777G ■ %AQ 
^CAnTGA ACCGGGCGAA G7GAAC7CAA TTA7A7CG7C CCGA7GAGCC GTA7CT777C 5700 

36£0 

_ ... _ ....wn.ww "£ c n 

w>uHj<. : i.oA CACG7CGAAA GAGGAACAAC 7CAAAC7CAG GTACC-iGA"- WriZ"~ Scln 

fj£"*f* *J™J* :F C7C - A ^C7acaaaa tcagaaagc: atSSctg sccc 
;;t.:: c -::, ac-"gggac acatcagcga gaa-gc7ata sgso 

^LlZ^'t ^u.AC.^A G7AGCG7ACG GGCGAAC7AC "CCGA7CCAC 

-bt^c.b: AGliLiC GT AACAAC"-'.: 7GC-7GAG-- r :±~"'.±ri T-cr^rrT 

'3^. : A ^-- ^a7a:gg- a GA cgggaca gtcc-:ctgcc sa^o 

.uc*..-«.:u. Artc.s.u.ix -.W-OI..AAGC iTAGAAC'A CCGGAAAAAA CA7"4GTATi t-rn 

cagcgatgca gaacac:c:a caaaatgtgc 6360 

; L -::i:!::r ■ H ^1" A :^-: -^caacg tcacgcaga - gcg7gaac" c:aacac7gg s^eo 

AC.l.-GLjAC Ai;CAATG~I GAA7GC7"" r*L±i*~±~~r ^ta^T"^ — - — r\rn 
-bu^u : . :ljl'AAGC3A -TTAGuA'TA CCAC7GAG" "G'C-C""- ~±~~~±Qr'± 





■ -■ - . n . . _ w v. ^„rs . . v_L CCLU 



ft A ^ f.~ T \~ \ ^ , ~ -- ■ ■ ■ - w v.*--... . 

GCACGAAACA CACAGAAGAA -GACCGAAAG 7ACAAG7GA" ACAAGCCG^ n^r^GG ^ 
C77ATGCGGG A77CACCGGG AA77AG7GCG 7AGGC77ACG GCCG7C77GC 

MLUhnLnl iCACAC-jl;: uiGACATGT CGGCGGAGGA 7777GATGCA A7CA7AGCAG 



6760 
68^0 
69C0 
6 76O 
7020 
70SG 



7 ' i P 



^±9:!;: ^ AlG,C -' ATCGCCAGCA GACmACTAGA AGAGCGGCTT AAAACG7CCA 

GAiu.uthGc Gi i C.ATTGGC GACGACAACA TCA7ACA7GG AG7AG7A7C7 GACAAAGAAA 7^0 

iL-ut;uAGAG GiGCGCCAC: 7GGCTCAACA 7GGAGG7"T-A GA7CA7CGAC GCAG7CA.7CG 7ik 

A ?; I|AC "*- = uCGGCGGat T7a7c::gca agattcggtt ac: t ccacag tgeg 

Cui'jLwJL-oi GiicGGA: CCC C'GAAAAGGC 7G777AAG77 GGG7AAACG C7C"AGG"" 7Vti 

ACGACGAGCA AGACGAAGAC AGAAGACGCG C7C7GC7AGA 7GAAACAAAG GCG^GG77"A 7^ 
GAG : A ^: AT ^CACGCAC- 7TAGCAG7GG CCG7GACGAC CCGG7ATGAG G7AGACAA7A 
1 ,hUC -' uI Gl i AC i Gul.-i ::tjAGAAC" "TGCGGAGAG CAAAAGAGCA 77CCAAGCCA 



73C0 
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rrlr^rl I GTACGG7G GTCCTAAATA GTCAGCATAG TACATTTCAT 7620 

I GAG ™ A r C J r A f AA AC SfSS? ATAGAGGATT CTTTAACATG CTCGGCCGCC 7 

"fS^ GGCCGCGGAG AAGGAGGCAG GCGGCCCCGA 7740 

KSJS- GGTTCTCAAA TCCAGCAACT GACCACAGCC GTCAGTGCCC 7800 

ISSUS ?2S2?? T AGACCTCAAC CCCCACGTCC ACGCCCGCCA CCGCGCCAGA 860 

CCACCGMGC CGAAGAAACC AAAAA( ™ GAGAAGAAGA 7 0 

COGACMATT JtSSJS ^ CATGGCACTT AAGTTGGAGG ' 79 ° 



8000 
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A7TGACGGCG TAGTACACAC TATTGAATCA AACAGCCGAC CAAT7GCAC7 AC CATC AC A A 60 

TGGAGAAGCC AGTAGTAAAC GTAGACGTAG ACCCCCAGAG TCCGTTTGTC GTGCAACTGC ign 

AAAAAAGCTT CCCGCAA77T GAGGTAGTAG CACA0CAGG7 CACTCCAAA7 GACCATGCTA f SO 

ATGtwAGAGt ATTiTCGCAT CTGGCCAGTA AAC7AA7CGA GC7GGAGG77 CC7ACCACAG P40 

CGACGATCTT GGACA7AGGC AGCGCACCGG C7CGTAGAAT G7777CCGAG CACCAGTATC ^00 

A77G7G7CTG CCCCA7GCG7 AG7CCAGAAG ACCCGGACCG CA7GATGAAA 7ACGCCAG7A ^0 

AACJuuCGGA AAAAGCG7GC AAGA77ACAA ACAAGAAC77 GCA7GAGAAG A77AAGGA7C 4£Q 

7CCGGACCG7 AC77GA7ACG CCGGA7GC7G AAACACGA7C GC7C7GC777 CACAACGA7G 480 

77ACC7GCAA CA7GCG7GCG GAA7A77CCG TGA7GCAGGA CG7G7A7A7C AACGC7CCCG 5*0 

GAAC7A7C7A 7CA7CAGGC7 A7GAAAGGCG 7GCGGACCC7 G7AC7GGA77 GGC77CGACA 600 

CCACClAGi7 CAiG7iC7CG GC7A7GGCAG G77CG7ACCC 7GCG7ACAAC ACCAAC7GGG " 660 

CCGACGAGAA ACT CC77GAA GCGCG7AACA l CGGAC777G CAGCACAAAG C7GAQ7GAAG ' 7E0 

G7AGGACAGG AAAA77G7CG A7AA7GAGGA AGAAGGAG77 GAAGCCCGGG TCGCGGG777 7 8 o 

A i i iCiCCGi AGGA7CGACA C777A7CCAG AACACAGAGC CAGC77GCAG AGC7GGCA7" P40 

mlCAiCGui G77C:AC""G AA7GGAAAGC AG7CG7ACAC 77GC:GC"7 ~.':[:G"" ^CO 

,GAG::UCGA AGGC7ACG7A G7GAAGAAAA 7CACCA7CAG KZZZZi^Z ACGGGAGAAA % 
CCG^GeGATA CGCGG77ACA CACAAiAGCG AGGGC7C" GC7A7GCAAA G7~AC7GACA 



TG7GGGCG77 7CGCAC7AAG AAAG7ACA77 CG7""A"G CCCACC'GGA ACGCAGACG- 1380 



iO£ n 

CAG7AAAAGG AGAACGGG7A TCGTTCCC7G 7G7GGACG7A CA7CGGGGC: ^C:^7A"GC' '060 

AiCAOAm.AC 7GG7A7AATG GCCACGGA7A TA7CAC:'GA CGA7GCACAA AAAC77C7GG l UQ 

muux.CAA CCAGCGAA" G.CA77AACG G7AGGACAA CAGGAACACC AACACCA7GC [? r $ 
AAAA, ;ACC7 7C7GCCGA7C A7AGCACAAG GG77CAGCAA A70GGC~^G "AGCGC.—GG 
A7GA7C77GA 7 A AC GAGA A A A7GC7GGG7A "AGAGAAC3 CAAGC77ACG T A7GGC7GC 



ictu 
!2£0 



„ _ wAGCC - : 7 777AGCGC7: 77CGGA~G*C GTCGGTA'GG ACGACGTC7 

lijuCGArGiC GG7GAGGCAG AAA7M3AAAC 7GGCA77GCA ACCAAAGAAG GAGGAAAAAC 

iGC;GCAGG7 C7CGGAGGAA 77AG.CA7GG -GGCCAAGGC 7GC~""AG GA7GC7CAGG 1*60 

AGGAAGCCAG AGCGGAGAAG C7CCGAGAAG CAC77:::ac: A77AC7GGCA GACAAAGGCA 16E0 

,CGAGuCAGC CGGAGAAG 7 " G7C7GCGAAG "GGAGGGGC7 CCAOGCGGAC A7CGGAGCAG 1680 

LAt iAGiiGA AACCCCGCGC GG7CACG7AA GGA7AA7ACG 7CAAGGAAA7 GACGG7A7GA 1740 

ICGLACAG7A TA7CGTTGiC 7CGCCAAAC7 C7G7GC7GAA GAA7GCCAAA C7CGCACGAG 1800 

CGCACCCGC7 AGCAGA7CAG G77AAGA7CA 7 A AC AC AC 7 C CGGAAGA7CA GG AAGG7AC" i860 

CGciiGAACC A7ACGACGG7 AAAG7AC7GA 7GCCAGCAGG AGG7GCCG7A CCATGGCCAG IScO 

AAiiC^iAGt AC 7G AG. GAG AGCGCCACG7 7AG7G7ACAA CGAAAGAGAG 7TTGTGAACG 1980 
GlAAAC : AiA CCACA.iGC: A7GCA7GGGC CCGCCAAGAA TACAGAAGAG GAGGAG7AC- 

AGu i \ ACAAA GGCAGAGC77 GCAGAAACAG AG7ACG7G77 7GACG7GGAC A AHA AGO G 7"" P t CO 

Glji iAAGAA GGAAGAAGC: 7CAGG7C7GG 7CC7C7CGGG AGAAC7GAC2 AACCC7CCC7 E160 

AilAiGAGCi AGC7C7GGAG GGAC7GAAGA CCCGACC7GC GG7CCGG7AC AAGG7CGAAA 2HH0 

CAAi AGcAGi GAi AGGCACA CGGGGG7CGG GCAAG7CAGC 7A77A7CAAG 7CAAC7G7CA 2rS0 

CuulAC-AGA IC7G77ACG AGCGGAAAGA AAGAAAA77G 7CGCGAAA77 GACGCCGACG £240 

7Gu i A AG AC i GAGuuu.A7G CAGA77ACG7 CGAAGACAG7 AGA77CGG77 A7GC7CAACG 24G0 

GA.Gl.AlAA AGGGG7AGAA G7GC7G7ACG 77GACGAAGC G77CGCG7GG CACGCAGGAG 3460 

CAiiACi.uL l i ;lA7 - GG7 A7CG7CAGGC CGCGCAAGAA GG7AG7A.C7A 7GC3GAGACG £5£9 
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CCA7GCAA7G CGGATTCTTC AACATGATGC AAC7AAAGG7 ACATTTCAAT CACCG7GAAA 25SO 

AAGACATATG CACCAAGACA TTCTACAAGT ATATCTCCCu GGG77GCACA CAGCCAGT7A 2640 

CAGC7A77G7 ATCGACAC'G CATTACGATG GAAAGATGAA AACCACGAAC CCGTGCAAGA 3700 

AGAACA77GA AA7CGATA77 ACAGGGGCCA CAAAGGGGAA GCCAGGGGA7 A7CA7CC7GA 2760 

CA7G777CCG CGGG7GGG77 AAGCAAT7GC' AAATCGACTA 7CCCGGACA7 GAAG7AATGA 2820 

CAGCCGCGGC C7CACAAGGG C7AACCAGAA AAGGAG7G7A TGCCGTCCGG CAAAAAGTCA 2980 

A7GAAAACCC AC7G7ACGCG A7CACA7CAG AGCATG7GAA CG7G77GC7C ACCCGCAC7G 2940 

AGGACAGGC7 AG7G7GGAAA ACC77GCAGG GCGACCCA7G GA77AAGCAG CCCAC7AACA 3000 

7ACC7AAAGG AAAC777CAG GC7AC7A7AG AGGAC7GGGA AGC7GAACAC AAGGGAA7AA 3060 

77GC7GCAA7 AAACAGCCCG AC7CGCCG7G CCAATCCG77 CAGC7GCAAG ACCAACG7T7 3120 
GG7GGGCGAA AGCA7TGGAA CCGAiACTAG CCACGGCCGG 7ATCG7AC77 ACCGG77GCC * ' 3180 

AG7GGAGCGA ACTGTTCCCA CAG777GGGG A7GACAAACG ACA77CGGCC A7T7ACGCC7 3240 

7AGACG7AA7 T7GCAT7AAG TnTTCGGCA TGGAC77GAC AAGCGGAC7G 7777C7AAAC 3300 

AGAGCATCC: ACTAACG7AC CATCCCGCGG A77CAGCGAG GGCGG7AGG7 CA7"GGACA 3360 
ACAGGCCAGG AACCCGGAAG 7A7GGG7ACG ^7CACGG3A7 7GGCGGGGAA G~":GCG7A 

GA777CCGG7 GhCCAGCTA GG7GGGAAGG GCACACAAC7 7GA777GCAG ACGGGGAGAA 3480 

CGAGAG77A7 C7C7GCACAG CA7AACC7GG 7CCCGG7GAA CCGCAA7C77 CC T CACGC:7 35^0 

7AG7CCCCGA G7ACAAGGAG AAGGAACGGG GCGGGG7CAA AAAA7'C*7G AAC3AG77CA 3600 

AACACCAC7C AG7AC7,G:G G7A7CAGA0G AAAAAA-GA AGCTCCCCGT AAOAGAA T GG 366G 

AA7GGA7CGC CCCGA77GGG A7AGGGGG7G CAGA7AAGAA C7ACAACG7G G""CGGG7 3720 

77CCGGCGGA GGCACGG7AG GACG7GG7G7 7CA7CAACA7 "GGAAC7AAA 7ACAGAAACC 3780 

ACCAC777CA GCAG7GCGAA GACCA7GGGG CGACC7"AAA AACGC~"CG CG7~CGGGGG 38^G 

7GAA7iGC:7 7AACCGAGGA GGCACGC7CG 7GG7GAAG7C C7A7GGC7AC GCGGACGGCA 39G0 

ACAG7GAGGA CG7AG7GACG GC7C" ; GGGA GAAAG7"G7 GAGGG~G~G7 GCAGCGAGAG 396G 

CACA77G7G7 C7CAAGCAA" ACAGAAA7G7 ACG7GA777 T CCGACAAC7A GA.CAACAGCG 402G 

G7ACACGGGA A77CAC:::G :aCCA7C"a A77GCG7GA7 "GG7CCG7G 7A"AGGG7A 4CSC 

CAAGAGA7GG AG 7 "GGAGCG GCGCGG7CA7 ACGGCACGAA AAGGGAGAA7 " A7~GG~GAC7" 41*0 

G7CAAGAGGA AGCAG77G73 AAGGGAGGCA A7CGGC7GGG TAGACGAGGC GAAGGAG7G7 42GG 

GCCG i GCCA7 C7A7AAAGG7 7GGGGGACCA G7777ACGGA 77CAGGGACG GAGACAGGCA 426G 

CGGGAAGAA7 GAC7G7G7GC C7AGGAAAGA AAG70A7CGA GGGGG7GGGG CC7GA777CC 4320 

GGAAGCACCC AGAAGCAGAA GGG77GAAA7 7GC7ACAAAA CGCC7ACGA7 GCAG7GGCAG 4280 

AC77AG7AAA 7GAACA7AAC A7CAAG7C7G 7CGCCA77CC AC7GG7A7C7 ACAGGGA777" 4440 

ACGGAGGCGG AAAAGACCGG G77GAAG7A7 CAC77AAC7G C77GACAACC GCGC7AGACA 4=00 

GAAC7GACGC GGACG7AACG A7C7A77GG3 7GGA7AAGAA G7GGAAGGAA AGAA7CGACG 456G 

CGGGAC7CGA AC77AAGGAG TCiG.AACAG AGC7GAAGGA TGAAGA7A7G GAGA7GGAC3 4620 

A7GAG77AG7 A7GGA7CCA7 CCAGACAG77 GC77GAAGGG AAGAAAGGGA 77CAG7AC7A 4680 

CAAAAGGAAA A77G7A77CG 7AC7:CGAAG GCACCAAA77 CCATCAAGCA GCAAAAGACA 4740 

7GGGGGAGA7 AAAGG7CC7G 7 : CCC7AA7G ACGAGGAAAG 7AA7GAACAA G7G7G7GGC7 4800 

ACA7A77GGG 7GAGACGA7G GAAGCAA7CC GGGAAAAG7G CCCGG7CGAC CA7AACGCG7 486G 

CG7C7AGCCC GCCCAAAACG 7;GGGG7GGG 777GCA7G7A 7GCCA7GACG CCAGAAAGGG 4920 

iCGACAGAGi TAGAAGCAA7 AACG7GAAAG AAG77ACAG7 A7GCCG7GG ACGGGGC77G 4960 

C7AAGCACAA AA77AAGAA7 G7iCAGAAGG 77CAG7GCAC GAAAG7AG7C C7G777AA7C 5Q4G 
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CGCACAC7CC CGCAT7CG7T CCCGCCCG7A AG7ACA7AGA AG7GCCAGAA CAGCCTACCG 5100 

CTCGTCCTGC ACAGGCGGAG GAGGCCCCCG AAG77G7AGC GACACGG7CA CCAIC7ACAG 5160 

C7GA7AACAC CTCGCTTGAT G7CACAGACA 7C7CAC7GGA 7ATGGA7GAC AG7AGCGAAG :E~Q 

GG7CAC7777 77CGAGC~ T 7 AGGGGA7CGG ACAAC7C7A7 TAC7AG7A7G GACAG77GG7 5E80 

CG7CAGGACC 7AG77CAC7A GAGA7AG7AG ACCGAAGGCA GG7GG7GG7G GC7GACG77C 5340 

A7GGCG7CCA AGAGCG7GGC CC7A77CCAC CGCCAAGGC7 AAAGAAGA7G GCCCGCG7GG 5400 

CAGGGGCAAG AAAAGAGCCC AC7CCACCGG CAAGCAA7AG C7C7GAG7CC C7CCACC7C7 5460 

C7777GG7GG GG7A7CCA7G 7CCC7CGGA7 CAA7777CGA CGGAGAGACG GCCCGCCAGG 5520 

CAGCGG7ACA ACCCC7GGCA ACAGGCGCCA CGGATG7GCC 7A7G7C777C GGA7CG7777 5580 

CCGACGGAGA GA77GA7GAG C7GAGCCGCA GAG7AAC7GA G7CCGAACGC G7CC7G777G 5640 
GA7CA777GA ACCGGGCGAA G7GAAC7CAA 77A7A7CG7C CCGA7CAGCC G7A7C7777C " 5700 

CAC7ACGCAA GCAGAGACG7 AGACGCAGGA GCAGGAGGAC 7GAA7AC7GA C7AACCGGGG 5760 

7AGG7GGG7A CA7A7777CG ACGGACACAG GCCCTGGGCA C77GCAAAAG AAG7CCG77E 56G0 

7GGAGAACCA GC77ACAGAA C3GACC7TGG AGCGCAA7G7 CC7GGAAAGA A77CA7GCCC 58S0 

CGG7GC7CGA CACG7CGAAA GAGGAACAAC 7CAAAC7CAG G7ACCAGA7G A7GC::aCGG 59<*C 

AAGGGAACAA AAG7AGG7AC CAG7C7CG7A AAG7AGAAAA TCACAAAGCC A7AACCAC7G 6C0G 

AGCGAC7AC7 G7CAGGACA CGAC7G7A7A AC7C7GCC.AC AGA7CAGC3A GAA7GCTA7A 6G60 

AGA7CA.CC7A iCGGAAACGA TTGTAC-.ZCA G7AGCG7ACG GGCGAAC7AC 7C:3A7C:aC 6'EO 

AQTTCGC7G7 AGG7G7C7G7 AACAAC7A7C 7GCA7GAGAA C7A7CCGACA G7AGCA7G7" 6iS0 

A7CAGA77AC T GACGAC~-C GAiGC"-C" "GGA7A7GG7 AGAGGGGACA G'CGCG'GCG 6c 40 

7GGA7AC7GC AACC77CGC CCCGC'AAGC "7AGAAG77A CCCGAAAAAA GA7GAC7A7A 620C 

GAGGGCGGAA 7A.7CCGCAG7 GCGG:~CCA7 CAGCGA7GCA G A AC AC GC 7 A CAAAA7G7GC 6360 

7CA77GGCGC AAC7AAAAGA AA77GGAACG 7CACGCAGA7 GCG7GAAC7G CCAACACTGG 6^c0 

AC7CAGCGAC A77CAA7G7C GAA70C"~C GAAAA7A7GC A7G7AA7GAC GAG7A77GGG 6480 

AGGAG 1 7CGC K^A^QCZ^ A77ACGA"- CCAC7GAG7"* 7G7CACCGCA 7A"7AGC7A 6:^0 

GAC'GAAAGG C:37AAGGC: GCCGCAC-7 "7GCAAAGAC G7A7AA7" G G'GGCA^GC 66GC 

AAGAAG7GGG 7A7GGA7AGA 7TG73A7GG ACA7GAAAAG AGACG7GAAA ]7"ACAC3AG 6660 

GCACGAAACA CACAGAAGAA AGACCGAAAG 7ACAAG70A7 ACAAGGGGCA GAACCCC7GG 67E0 

CGAC7GC77A C77A7GCGGG A77CACGGGG AA77AG7GCG 7A0GC77ACG GCCG7C77GC 6760 

TTCCAAACA; 7CACACGC7: 777GACA7G7 CGGCGGAGGA 7777GA7GCA A7CA7AGCAG 6840 

AACAC77CAA GGAAGGCGAC CCGGi AC7GG AGACGGA7A7 CGCA7CA77C GACAAAAGCG 69C0 

AAGACGACGG 7A7GGCG77A ACCGG7C7GA 7GA7C77GGA GGACG7GGG7 G7GGA7CAAC' 6360 

CAC7AC7CGA C77GA7CGAG 7GCGCG777G GAGAAA7A7C A7CCACGCA7 C7ACC7ACGG 70E0 

G7AC7CG777 7 AAA77CGGG GCGA7GA7GA AA7CCGGAA7 G7 T CG7CACA C7~777G7CA 70S0 

ACACAG7777 GAA7G7CG77 A7CGCCAGCA GAG7AC7AGA AGAGCGGC77 AAAACG7CCA 7 MO 

GA7G7GCAGC G i TCATTGGC GACGACAACA 7CA7ACA7GG AG7AG7A7C7 GACAAAGAAA 7EG0 

7GGG7GAGAG G7GCGCCACG 7GGC7CAACA TGGAGG77AA GA7CA7CGAC GCAG7CA7CG "E6G 

G7GAGAGACC ACC77AC77C 7GCGGCGGA7 77A7C77GCA AGA77CGG7" AC7CCACAG 73E0 

CG : GCCGCG7 GGCGGA : CGG C7GAAAAGGC 7G777AAG77 GGG7AAACC* C7CGCAGCGG 7380 

ACGACGAGGA AGACGAAGAC AGAAGACGGG C7C7GC7AGA 7GAAACAAAG GCG7GG777A 74^0 

GAG7AGG7A7 AACAGGCAC7 T7AGCAG7GG CGG7GACGAC CCGG7A7GAG G7AGACAA7A 75C0 

77ACACC7G7 CZ~±Z~^C± T7GACAAC T 7 "7GGCCAGAG CAAAAGAOCA 7"CAACCCA 756G 
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7CAGAGGGGA AA7AAAGCA7 C7C7ACGG7G G7CC7AAA7A G7CAGCA7AG 7ACA777CA7 76£Q 

C7GAC7AA7A C7ACAACACC ACCACCA7GA A7AGAGGA77 C777AACA7G C7CGGCCGCC 76S0 

Gcccc77ccc G&z:::x:i gcca7G7gga gggcgcggag aaggaggcag gcggcgccga 77^0 

iGCCiGlCCG CAACGuuC7G GG77C7CAAA- 7CCAGCAAC7 GACCACAGCC G7CAG7GCC" 7800 

7AG7CA77GG ACAGGCAAC7 AGACC7CAAC CCCCACG7CC ACKCujCCA CCGCGCCAGA 7860 

AGAAGlAQjC GCCCAAGCAA CCACCGAAGC CGAAGAAACC AAAAACGCAG GAGAAGAAGA 7=20 

AGAAGCAACC 7GCAAAACCC AAACCCGGAA AGAGACAGCG CA7GGCAC77 AAG77GGAGG 7980 

C:GACAGA7t Gi i CGACGTC AAGAACGAGG ACGGAGA7G7 CA7CGGGCAC GCAC7GGCCA 80*0 

7GGAAGGAAA GG7AA7GAAA CC7C7GCACG TGAAAGGAAC CA7CGACCAC CC7G7GC7A7 8100 

CAAAGC7CAA A777ACGAAG 7CG7CAGCA7 ACGACA7GGA G77CGCACAG T7GCCAGTCA 8160 
ACA7GAGAAG 7GAGGCA77C ACC7ACACCA G7GAACACCG CGAAGGA77C 7A7AAC7G0C~ ' 8E~0 

ACCACGGAGC GG7GCAG7A7 AG7GGAGG : A GA777ACCA7 CCC7CGCGGA G7AGGAGGCA 8GS0 

GAGGAGACAG CGG i CGTCC3 A7CA7GGA7A AC7CCGG7CG GG77G7CGCG A7AG7CC7CG 33^0 

L7GGCGC7GA 7GAAGGAACA CGAAC7GCGG 777CGG7CG7 CACC7GGAA7 An'-^GGG^ S^CG 

AGACAA77AA GACGACCGGG GAAGGGACAG AAGAG7GG7C I GO AGO AC G A C7GG7CACGG 3^G 
CAA7G7G777 GC7CGGAAA7 G7GAG""C CA7GCGACGG CGGGCGGACA 7GCA7ACCC 

G ^^:; TC CAGAGt=: — GACA7CG"" AAGAGAACG 7 GAACCA7GA0 GGG'ACGA7A S38C 

CGG7GG7CAA 7GCGA7-7"G CGGiGCGG.-" GG7C7GGCAG AAGCAAAAGA A0CG~GA7" 86*0 

ACGACm - AC CC7GACGAGC CGG'AC":G GCACA7GCG G7AC7GCGAC CA~AC"G7AC 87GC 

ljiul; iCAG u-w.-j: • AAG A.CGAGCAGG 7C7GGGACGA AGCGGACGA" -ACACGA7A- r 37^0 

GCA7ACAGAC T'ZZZZZ^Z ""GA7ACG ACCAAAGCGG AGCAGCAAGC GGAAACAAG7 8820 

ACw'jl j AGA i GiCGC^AO CAGGA7CACA CGG77AAAGA AGGCACGA7G GA:GACA7GA 388C 

AGA77AGGAG C'CAGGACCG "G7AGAAGGC 77ACC7ACAA -0GA7AC"7 C'GG'GGGAA 89*0 

AA7GGGG7CG AOGGGACAOC G7AACGG~~A GCA7AG7GAG 7AGCAAC7CA GCA-GG7CA" 9CCC 

-ACAG7GGC CCGCAAGA7A AAACGAAA-" T CG7GGGACG GGAAAAAT-7 GATAC": 9C6C 

CGG; ; CACGG TAAAAAAA7" CC77GCACAG 7G7ACGAGGG 7C7GAAAGAA ACA-C~GCAG 9:20 

-G7ACA7CAC 7A7GCACA0G GGGAGACGGG ACGC77A7^C A7GG7ACC" GAAG-A7CA" 9I5C 

GAGGGAAAG7 7 1 ACGCAAAG CCGCCA7C~G GG A AG A AC A" "ACG7A7GAG 7GCAAG7GCG 92-C 

GCGACTACAA GACGGGAACG G7TGGACGG GCACCGAA-7 GAC7GG77GC AC GGCGA7C A 92GG 

AGCAG7GCG7 CGCC7A7AAG AGCGACCAAA CGAAG7GGG7 C77CAAC7CA CCGGAC77GA 9360 

7CAGACA7GA CGACCACACG GCGGAAGGGA AA77GCA777 GCG777CAAG 77GA7CCCGA 94c0 

G7ACC7GCA7 GG7CCC7G77 GCCGACGCGG CGAA7G7AA7 ACA7GGC7T AAACACA7C-' 9^80 

GCC7CCAA77 AGA7ACAGAC CAC77GACA7 7GC7CACCAC GAGGAGAC'A GGGGCAAACG 9340 

C^uAACCAAL CACiGAA7GG A7CG7CGGAA AGACGG7CAG AAAC77CACG G7CGACGGAG 96CC 

A.GGCG7GGA A7ACA7A7GG GGAAA7CA" AGCCAG7GAG GG7C7A7GCG CAAGAG7CAG 9660 

CACGAGGAGA CCC7CACGGA 7GGCCACACG AAA7AG7ACA GCA77AC7AC GA7CGCGA7G 9720 

-I^ 7 ™ CA7C77AGCC G7CGCA7CAG C7ACCG7GGC GA7GA7GA77 GGCG7AAC7G 9780 

77GCAG7Gi« A7G7GCC7G7 AAAGCGCGGG G7GAG7GGG7 GACGCCA7AC GCGI'GGCCG 98-0 

CAAACGCGG7 AATCGGAAC7 7CGC7GGCAC 7C77G7GG7G CG77AGG7CG GGCAA7GC7G 99CC 

AAACGmCAC CGAGACGA7G AG77AC77G" GG7CGAACAG 7CAGCCG7" ;"~GGG7CC 9960 

AG : f L i GCA i ACG777GGGG GC777CA7GG 77C7AA7GCG C7GC7GC7GG 7GCGCG7GG 10020 

I. : : i i AGi Gl* i ! 'sJlGGcC GGG7ACCGG CGAAGG7AGA CGCC7ACGAA CA'GGGACGA I0G6C 
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C7G77CGAAA "GTGCCACAG ATACCGTATA 
CGCiCAATTi GGAGATCACT G7CA7G7C37 
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CC777A7G7G GGGAGGAGCG CAA7G77777 
CG7ACG7CGA AT7G7CAGCA GA77GCGCG7 
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777CAGCA7C GT7TACGCCA 77CGA7CATA 
AC7A7GAC77 CCCGGAATAT GGAGCGATGA 
CCTCG77GAC TAGCAAGGA7 C7CA7CGC:A 
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SEQUENCE LISTING 



(1) GENERAL INFCRMATICN: 

(1 ) APPLICW : Cubensky Jr. . lYams W. 
Polo. Jem M. 
Belli . Barbara A. 
Schlesincer. Scrcra 
Dryga. Sergey A. 
Frolov. Ilya 

Cii) TITLE CF INVB/UOJ: REGMIN/WT AL&AVIRLJS-34SED VEuCRS 
WITH REDXED IWI3mCN CF CELilL^ MACT-,MCLEa£flR 
SYNTHESIS 

(iii) NLT€ER CF SEQUENCES : 118 

(iv) CORRESPONDENCE ATXFESS: 

W) ■^CCPtpE: SEED and 3EPPY LI? 

(3) jiKEr;: 6300 Cc-lurbia Center. 701 Fifth Avenue 

(C) CITY: Seattle 

CD) STATE: Washingccn 

(E) COUNTRY: USA " 

(F) ZIP: 981C4-7C92 

Cv) COMPUTE ifcaOfiELE FCFM: 

(A) .VEDILM T^FE: Fleecy disk 
(8) CCmjTEP: IBM PCcaroadble 
CO CPEPATING SYSTEM : PC-DGS/f-S-OGS 

(D) SOFTWARE; Patsntln Release #1.0. Version #1.30 

(vi) GJRRENT APPL ICATTCN GATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 06-0X1-1597 
CO CLASSIFICATION; 

(viii) ATflREf/XxNT INFCPMATICN: 

CA) NAME: ^aszers . Oevid 0. 

(5) REGISTRATION 33.963 

CO REFE?aCE/T£C:<EI WTCER: 930C49.457C4 

Cix) TELECOMN ICATICN ^FCPttATIC;; 
(A) TELEMC (206) 622-4900 
(3) TELEFAX: (206) 552-6031 
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(2) INFCfiMATICN FCR SEQ ID N0:1: 

(i) SEQUENCE WR^CJERISTTCS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) S7WCENE5S: single 

(D) TCPCLOGY: linear 



Cxi) SEQUENCE DECRIFTICN: SEQ ID ND:1: 

ATCTCIAC32 TGJTCCTAAA TAG7 24 

(2) INFORMATION FCR SEO ID NO: 2: 

( i ) SECUECE CH^CTERISTiCS: 
(A) LB^JGTH; 48 case pairs 
(S) TYPE: nucleic acid 
(C) S7RANCEENES5: single 
(0) TCPCLOGY : linear 



Cxi) SEQUENCE DESCRIPTION SEQ ID N0:2: 

GSIH^GCTC TmX&O CAGATPCAT TG^LGCGTA GTACACAC 48 

(2) INFORMATION FCR SEQ ID N0:3: 

(1) j£a'ENC£ CHARAOtRISTICS: 
(A) LENGTH: 15 base pairs 
(S) TYPE: nucleic acid 

(C) STRANGENESS: single 

(D) TCPCLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEO ID NO: 3: 
MTTTUTGCC TU€£ 
C2) INFCRHATICN FCR SEQ ID NO: 4: 

CD SEQUENCE CHWOERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
CO STRANDECNE5S : single 
CD) TCFCLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEO ID N0:4: 
TATGCWG7 TACT&C 
(2) INFC&WICN FCR SEO ID NC:5: 

(i) SEQUENCE C J AR£GIRISTiG: 

CA) LENGTH: IS sase :airs 

CB) TYPE: nucleic acid 
CO STT^NCELNESS: s:r.cie 
(0) TCPCLOGV: li rear " 



(xi) sequence descrifticn : seq id nd:5: 
gutcattac ttcatgtc 
(2) infcgmticn fcr seq id n0:5: 

ci) sequence character isttcs: 

CA) LENGTH: 15 base pairs 
(B) TYPE: nucleic acid 
CO STRANCECNESS: single 
(D) TCFCLCGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
GdJK^TDV CTTTC 
(2) INFCRMATICN FCR SEQ ID NO; 7: 

(1 ) SECUBJCE GWCTERISTICS: 

(A) LENGTH : 19 base pain; 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) 7CF0L0GY: linear 



Cxi ) SEQUENCE OESCRIPTICN: SEQ ID NO:7: 
ATTCCGfLAT TTC^CCGT 
(2) INFCRMATICN FCR SEQ ID NO: 8: 

(ij SEQUENCE CKWGERISiTCS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
(0 SiKANDELNESS : single 
CD) TCPCLOGY : linear 



(xi) SECURE DESCRIPTICN; SEQ ID N0:8: 
TAWTTG^G CTTTG 
(2) INFCRMA7ICN =CR SEQ ID NC:9: 

fi ) SE-liEMCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(8) T/PE: nucleic acid 
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(C) SIWCEENESS: single 
CO) TCPCLCGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

GGCATATGGC ATOGTTG 

C2) INFOWTICN FCR SEQ ID NO: 10: 

(i) SEQUENCE Cm^CTERISnCS : 
CA) LENGTH: 19 base pairs 
(B) TYPE; nucleic acid 
(0 STRANDEDNESS: single 
CD) TCPXOGY: linear 



(xi) SEOJENCE DESCRIPTION: SEQ ID NO:i0: 
CGGCCATGG a^g^gg 
(2) INFCKMATICN FCR SEQ ID NO: 11: 

(D Science char/obistks: 

CA) LENGTH: 58 base pairs 

CB) TYPE: nucleic acid 
(0 STRANDEC3NB3: single 
CD) TCPCLCGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID iSJOill: 
CCCCTCG^GG Gi i i 1 1 1 i i i ) 1 1 1 1 1 1 1 1 1 TTGAM7GTT /VVWOWA 1 1 1 1GI lb 
(2) INFCf^ATICN FCR SEQ ID NO: 12: 
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(i) SEQUENCE QWUOTSTTG: 
CA) LE>JGTH: 48 base pairs 
(8) TYPE: nucleic acid 
(0 STRWEENES5: single 
(D) TCPCLCGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 12: 
TATATGGGCC 0&T7OTGT AT7k£GGCG 
(2) INFCKMATTCN FCR SEO ID NO: 13: 

Ci) SEQUENCE CHAR^OtRISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
(0 SnWJCECNESS: single 
(D) 7CPCLQGV: linear 



Cxi) SEQUENCE DESGIPHCN: SEO ID NO: 13: 

(2) INFOWTICN FCR SEQ ID NO: 14: 

(1) SEQUENCE CWWRISTICS: 

CA) LENGTH: 21 base cams 

CB) TYPE: nucleic acid 
(C) STONCEDNES5: single 
(0) TCPCLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
A I ^OAGCCA CGGCCG3TAT C 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARCTEUSTICS: 
(A) LENGTH: ZL base pains 
(8) TYPE: nucleic acid 
(C) SiRWEDNESS: single 
(0) TOPOLOGY; linear 



(xi) SEQUENCE OESCFJFTTOJ: SEO ID NO: 15: 

tcctcmtcg xzxirc&G c 21 

(2) INFORMATION FOR SEQ 10 N0:16: 

(i) SEQUENCE C-AR^CTFJSTIG : 

(A) LENGTH: 21 -base cairs 

(B) TfFE: nucleic acc 

(C) STRAMKNESS: smale 
(0) TOPOLOGY; linear 



Cxi) SEQUENCE GESCRIP7IGN: SEQ ID N0:16: 

ficcn&G: gcmtucct g 

C2) INFORMATION FOR SEQ ID NC:17: 

(i) SEOJBCE CHAR/OBIS7IG: 

(A) LENGTH: 21 base pairs 

(B) T/PE: nucleic acid 

(C) S73ANCEENESS: single 
CD) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION: 5EQ ID NO: 17: 

cctitidxjG q^tccgcca c a 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE (PAf^CTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAN3EDNESS: single 

(D) TCfCLQGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

G7GGC3GATC CCCTG^G G 21 

(2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE OiARACTHISTICS: 
CA) LEj^JGTH: 20 base pairs 

(B) TYPE: nucleic ace 

(C) STR£NCEENE55: single 
(0) TOPOLOGY : linear " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
TGGGCCGTUT GGTCGTCATC 20 
(2) INFORMATION FOR SEO ID NO:20: 

(i) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STOWEDNE53: single 

(D) TCFCLQGY : linear 
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(xi) SECURE DESCRIFTICN: SEQ ID N0:2Q: 
TG3GTUTTO\ ACTU£CGGA C 
(2) INFL^TIGN FCR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE; nucleic acid 
CO STRWEENESS: single 
(D) 7CPCL0GY; linear 



(xi) SEXe.CE CECRIPTICN: SEQ ID .NC:21: 
CAATTCG^Cj taccvCXac tc 

(2) INFCKMATICN FCR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
CO S7RANCECNESS: single 
CD) TCPCLOGY: linear 



Cxi) SECUEMCE CESCRIPTICN: SEQ ID N0:22: 
Gf2GtG=GECG TAGJTCGAAT 1G 
C2) INFCft M ATICN FCR SEQ ID NG:23: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 
(0 STRANGENESS : single 
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(D) TUOLCGY: linear 



Cxi) SEQUENCE CESCRIPHCN: SEQ ID N0:23: 
TATATOAG Alilliiill ilillitiil TTTTTTT7TT TTTTTTWA TG 52 
(2) INFLATION FOR SEQ ID NO: 24: 

Ci) SEQUENCE OWCTERISTIG: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acici 
(0 STl^NDECMESS: single 
(D) TCPCLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATATATCTAG >£CCA 7G33C GGC*G 
(2) INFCfiWICN FCR SEQ ID NO: 25: 

Ci) SEQUENCE C-ARCTaiSTICS: 

(A) LENGTH: 35 base pairs 

(B) T7FE; nucleic acid 

(C) SIWIDECNESS: single 
CD) 7CPCL0GY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
ATA7AGGA7C CC7G17A7AC AGGSCSTAC* OTTC 
(2) INFWATICN FCR SEQ ID N0:26: 

Ci) SEQUENCE CHARACTERISTICS : 
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(A) LENGiTU: 35 base pairs 

(B) TYPE: nucleic acid 
(0 STOM3ECME5S: single 
(D) 7CPCLOGY: linear 



Cxi) SEQUENCE DESCRIPTION SEQ ID NO:26: 
ATATrOSGA G^CCATI^ATT GVTOTC GMTG 35 
(2) INFORMATION FOR SEQ ID NC:27: 

(i) SEQUENCE C^RCTtRISTIG: 

(A) LENGTH; 35 base pairs 

(B) TTFE: nucleic acid 
(C; STRANGENESS: sircie 
CD) TCPCLQGY : linear 



(xi ) SESUENCE DESCRIPTION: SEO ID NC:27: 
TATATAGC3G CCCw&iA GACTCGTCA ,^GVG 35 
C2) INFORMATION FCR SEO ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 21 base pairs 

CB) T{?£: nucleic acid 
CO STRANDEENESS: sirale 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIFTICN: SEQ ID N0:2S: 
TGTPGATGS TG^CGOTGTC 6 
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(2) INFORMATION FCR SEO ID NO: 29; 

(i) SEQUENCE CHARACTERISTIG : 
(A) LENGTH: 22 base pairs 
CB) TYPE: nucleic acid 
fC) STR/VCEDNESS: single 
(D) TOPOLOGY; linear 



(xi) SECUENCE DESCRIPTION: SEQ ID N3:29: 

GVG7GCCAG Ai«£CTCC CG 

(2) INFORMATION FCR SEQ ID N0:30: 

(i> SECt9.CE WRAHFJSTICS: 
CA) LENGTH: 3* base cams 
(S) Tr'FE: nucleic acid 

(C) S7R4CECNE5S: single 

(D) TCPCLCGY: linear 



Cxi) SECUENCE DESCRIPTION: SEO ID N0:20: 
TATATUTuGA G3G7GGIbTT GTACTATTAG TCAG 
(2) INFCRmTION FCR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 41 base pairs 

(B) ™PE: nucleic acid 

(C) SiFANDEuNESS : single 

(D) TOPOLOGY: linear 



(xi) SECUENCE DESCRIPTION: SEQ ID N0:31: 
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TATATATATA THHICGCC GGWjGCCCC MTOTma C 41 

(2) INFCRMATICN FCR SEQ ID ND:32: 

(i) SEOJENCE CHWCIERISnG; 
(A) LENCTH: 65 base pairs 
(8) TYPE: nucleic acid 
CO STTWCEENE55: single 
CD) TOPCLQGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 
CTATAG^GCT (jJTTTiWCT TTTTTTTTiT 1 1 1 i 1 1 1 1 1 1 TTTTTTiTTT 1 1 1 1 1 i i i ib 
AAATG 

(2) INFCfiMATICN FCR SEQ ID N0:33: 

(i ) SEQUENCE C J AR£GE3I^1G: 
CA) LSCTri: 12 base pairs 
(S) TYPE: aclelc acid 
CO STIWICECiNESS: single 
CD) 7CPCLGGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TCGATCCTpG GA 

(2) INFWATIGN FCR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTIC: 

CA) LEMGTTH: 11 base pairs 

CB) TYPE: nucleic acid 
CO S7R4NCECNE3S: single 
(D) TCFCLOGY: linear 



60 
65 
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(xi ) SEQUENCE DESCRIFTICN: SEQ ID N0:34: 
C2) INFDRMOTCN FCR SEO ID NC:35: 

(i) sequence characteristics: 

(A) LBGTVi: 20 base pairs 

(B) TYPE: nucleic acid 
CO SfRNC-EENESS: single 
CD) TCPCLQGY: linear 



Cxi) SEQUENCE CE5CRIPTICN: SEQ ID N0:jc: 
(2) INFORMATION FCR SEQ ID NC:26: 

en sequence (ymjERisncs: 

(A) L£>JGTH; 20 base oairs 

CB) TYPE: nucleic acid 

(C) S7TWCEDNE5S: single 

(D) 7CPGL0GY: linear 



(xi) SEOJENCE CESCRIPTICN: SEQ ID NO: 36: 

GXCTX&T CCTtfCTCCG 

(2) INFCfiMATICN FCR SEQ ID NO: 37: 

CD SECUENCE C J ARCTERIS7IG : 
(A) LENGTH: 44 base pairs 
CS) TYPE: nucleic acid 
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(C) STRWEENESS: single 
CDJ TCPOUOGT^: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
TATATATG^G CTUTMTAM ATI>GGAMT TGCAICGCAT TGTC 
(2) INFORMATION FCR SEQ ID N0:38: 

(i) SEOUENCE G-AR^CTlRISTTG: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEENE5S: sircle 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TA7A7GAATT Oi.TPGAATGA MC7AC7CA G^OATGCGA TOO 

(2) INFORMATION FOR SEO ID NG:39: 

(<) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 56 base pairs 
CB) TYPE: nucleic acid 
(C) STRANCECNESS: single 
CD) TCPCLOGY: linear 



(xi) SEQUENCE CE3CRIFTICN: SEQ ID ,\0:39: 
mwtAAt. GGGTCGGCAT GGCATC7CC4 Cu CC7CGCG GTCCGraG GECATC 
C2) INFORMATION FCR SEQ ID N0:40: 
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(i) SEQUENCE OWCTERISnCS: 

(A) LENGTH; 69 base pairs 

(B) TYPE: nucleic acid 

(C) 51WCEDE5S: single 

(D) TCfOXGY: linear 



(xi) SEQUBJCE DESCRIPTION: SEQ ID N0:40: 
TATATC2GCT COCCCTCG CCATCC&GT G^CGTCGCaT CCTCCnGS A TCCCC^ST 

(2) INFCFMTICN FCR SEQ ID N0:41: 

(i) SECUBiCE CH^ChRISTIG: 
(A) LE>JGTH: 34 case pairs 
CB) TYPE: nucleic acid 
CO S71WCEENESS: single 
CD) TUPCLQGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

TATA7AI2GA TCTTTb^T TOTTATiGA CTPG 

(2) INFCFMATICN FCR SEQ ID NC:42: 

Ci) SEOJENCE WRCTERISTICS: 
CA) LENGTH: 42 base pairs 
(B) TYPE: nucleic acid 
CO STRANCEDNES3: single 
CD) TCPCLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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ujzkmtk: ggixctm pc&gjos otatata^ cc 
(2) infowticn fcr seq id n0:43: 
(i) seobjce characteristics: 

(A) LENGTH: 38 base pairs 
CB) TYPE: nucleic acid 

(C) STKANDEENESS: single 

(D) TCPCtQGY: linear 



Cxi) SEOJENCE OECRIPTICN: SEO ID N0:43: 
GCiUJi7TA3 TkWCGTAI T&CGGCGTA GtflOQC 
(2) INFCR^ATICN FCR SEQ ID N0:44: 

(i ) SEQUENCE CHARCItRISTIG: 

(A) LBGTT-!: 23 -base pairs 

(B) TYPE: nucleic acid 
CC) STRANCECNES5: si rale 
CO) TTPCLCGV: linear 



Cxi) SEQUENCE DESCRIPTION SEQ ID N0:44: 

CTGGWCCG GT&GTXZA TAC 

(2) INFCFMATICN FCR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 22 base.Dairs 

CB) TYPE: nucleic acid 

CC) STRANG ECNESS; single 
(D) TCPCLCGY: linear 
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(xi) SEOBJCE DESCRIPTION 5EQ ID N0:45: 
QJATO TOCuTGCCG TC 
(2) INFORMATION FCR SEQ ID N0:46: 

CD SECUENCE QWCTERISTICS: 

(A) iimi: 34 base pairs 

(B) TYPE: nucleic acid 

(C) SiRWDEDNESS: single 

(D) TTKLOGY: linear 



(xi) SEEUBCE DESCRIPTION: SEO ID UD:46: 

CCTAlTxGGC CGC3Tu3W CT7TGTGGC7 CC7C 

(2) INFORMATION FCR SEO ID N0:47: 

(1) SEGUBJCE CHARACTERISTICS: 
(A) LEMCm-f: 31 base pairs 
C3) TfFE: nucleic acid 
(0 5TPA\CECNE5S; single 
(0) TCPCLGGY: linear 



(xi) SEOJENCE DESCRIFTICN: SEQ ID .NJC:47: 
CCTATTGGCC JW&CCM TTTA113CCTA C 
(2) IMFCfiMATICN FCR SEO ID N0:4S: 

(i) SEOJQjCE CHARACTERISTICS: 

(A) LBjGTK: 34 base pairs 

(B) TYPE: nucleic acid 
iC) STRACECNE55: single 
(D) TOPOLOGY: linear 
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(xi). SEQUENCE DESCRIPTION SEQ ID N0:48: 
CCTA7GCGSC (ZCl&OS &WCAAT £CG 
(2) INFCRMATICN FCR SEQ ID ND:49: 

Ci) SEOJBJCE OWCTERISTTG: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
(0 ST^NDEENESS: single 
(D) TCrCLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NC:49: 

CCTAT7GGCC -Cv^G^CA TCATCG3GC fiG 

(2) INFWTTOJ FCR SEQ ID NO: 50: 

(i) SEQUENCE C-ARC7ERI5TIG: 
(AJ LENGTH: 39 base pairs 
(B) TYPE: nucleic acid 
CO S7RWDEDNESS: single 
CD) TCfCLOGY: linear 



(xi) SEQUENCE DESCRIPTION SEQ ID N0:50: 

TATATATCCG GWGuU A AaZTAAATAJ AAftAiTTTT 

(2) INFCFMTICN FCR SEQ ID N0:5I: 

Ci) SEQUENCE C-/WTERISnC: 
(A) LENGTH: 43 base pairs 
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(B) TYPE: rucleic acid 
CO STOtfCEENESS: single 
CO) OTOLOGY: linear 



Cxi) SEOJBJCE DESCRIPTION; SEQ ID ND:51: 
TATATAG3AT GTTlaSWUr AWAOCA A£ 

C2) INFORMATION FCR SEQ ID N0:52: 

(T) secub.ce (ymcrs.isri(S: 

(A) LBjGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) S7EANCEDNE53: si rale 

(D) TOPOLOGY: linear " 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TCGATCCiAG GA 

(2) [NFCRMATTCN FCR SEQ ID NO: 53: 

C i ) SEQUENCE OWiCiERISTIG : 
(A) LENGTH: 16 base pairs 
(3) TYPE: nucleic acid 
CO S7RWENE33: single 
CD) TCPCLOGY: linear 



Cxi) SEQUENCE DESCRIPTICN: SEQ ID NO: 53 
GC7CTTAA17 A^GC ' 
(2) rNFCRMTICN FCR SEQ ID NO: 54: 
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(i) SEQUENCE CH^CIERISnCS: 

CA) LENGTH: 55 base pairs 

CB) TYPE: nucleic acid 
CO STOMjEDNESS: single 
(D) TCPGLCGY: linear 



Cxi) SEGLENCE DESCRIPTION: SEQ ID N0:54: 
OiJTTTAAAC AOTC7TATC TCG^GTATGC GGOSTATG MTTCSITTA AACGA 
C2) IMFORMAHCN FCR SEQ ID N0:55: 

(i) SEQUENCE CKW"IS7ICS: 

(A) LENGTH: 55 case pairs 

(B) TYPE: nucleic scid 
CO STRANDENES : single 
CD) TCPCLOG'r': linear 



Cxi) SECLeCE CESCRIPTiCN: SEQ ID N0:£5: 
TCGTTTAAAC GAATTCATAG CGGuSWTA CTG^GATM GA1UTCTTTA AACAG 
(2) INFCRmnCN FCR SEQ ID N0:56: 

(i) SEQUENCE CFARAGERrSTICS: 
CA.) LENGTH; 37 base oairs 
CB) TYPE: nucleic acid 
CO STRANGENESS : single 
CD) TCFCLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:56 
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ATATATCGG3 /GTCCSCCG CTR^IO^ CASCTA 
(2) INFCRMflJTCN FCR SEQ ID N0:57: 

(i) SEQUENCE CHWCTERISTICS: 

CA) LBGTH: 26 base pairs 

CB) TYPE: nucleic acid 
CO S7WJDENESS: single 
(D) TCPCLOGY: linear 



(;<1) SEaBCE CECRIPTTCN: SEQ ID N0:57: 

A7ATAGGA7C aC^GVGW C7CJ7C^GA AGSOa 

(2) INFCFMATICN rCR SEQ ID NC:53: 

(i) SEQUENCE CHARAraiSTIG: 
(A) LE>Oi: ^4 ease pairs 
(3) TYPE: nucleic acid 
(C). STRANGENESS: single 
CO) TCPCLGGY: linear * 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:58: 

TATATATlEtG ClUTTAC'ViA TAWGWA GCA7CXAAA TTTC 

C2) INFCFMHCN FCR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTZSISTICS; 
(A) LE^: 36 case pairs 
(3) TYPE: nucleic acid 
(0 STRANGENESS: single 
(D) TCPCLOGY: linear 
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Cxi) SEQUENCE DECRIFTICN: SEQ ID NQ:59: 
TATA7GWTT GjTTTG^CA MIXJWJ AGAAIG 
(2) INFCRmnCN FCR SEQ ID NO: 60: 

(i ) SEQUENCE aWOERISHC: 

(A) LBG7H; 35 base pairs 

(B) TYPE: nucleic acid 
CO STOCEENESS: single 
(D) TCFCLCGY: linear 



Cxi) SECUBCE DESCRIPTICN: SEQ ID ,VjO:6C: 
TA7A7AGA7C TAG7U7TATE CAA7AC7C77 GTAI7 
(2) INFCFMA7ICN FCR SEC ID NG:6I: 

CD SEQUENCE WAXfiCcilSTlCS: 

(A) LENGIK: ^ base pairs 

(B) TYPE: nucleic acic 
£0 S7PANCENES : siraie 
CD) TCPCLCGY : linear " 



(xi) SEOJENCE GECRimCN: SEQ ID ND:6I: 

GGGATACTCA CCACTAIATC TG3ACGGTAT O^GGT^S GU 

C2) iMFCh^ATIGN FCR SEO ID NO: 62: 

CD SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 22 base oairs 
CB) TYPE: nucleic acid 
(0 STONCEENE5S: single 
CD) 7CPCLGGY. linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
&TAIAGIG3 T&2GTATCCC CG 
(2) INFCKM4TICN FCR SEQ ID N0:63: 

(i) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 34 base pa in; 

(B) TYPE: nucleic acid 
(0 STT<ANDECMESS : single 
(D) TCPCLOGY: linear 



(xi) SEQUENCE CECRIPTICN: SEO ID ND:63: 
TA7ATGGA7C CCL^GATCC TTMTOTCT QATG 
(2) INFCWTICM FCR SEQ ID NQ:64: 

{ i ) SECie.CE C-AFACTIRISTICS : 

(A) LENGTH: 2S base pairs 

(B) TYPE: nucleic acid 

(C) 57RANCEDNE5S: single 
CD) TCfCLQGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID ,NJO:& 

TATATATGCA TCCCCCCCCC CCCCAAX 

(2) WFCfiWICN FCR SEQ ID NO; 65: 

(l) SEQUENCE CHARACTERISTIC: 
(A) LE^JGTH: 43 base oairs 
C8) TYPE: nucleic acid 
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(C) 5HWCEENESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIFTIGN: SEQ ID NO:£: 
CA7GGGWC GATCCXATC CUPOAKG 7{£i7TTOAA PGG 43 
(2) INFCRMT7CN FCR SEQ ID NO:66: 

(i) SEQUENCE WRACTERISTICS ; 

CA) LE>JGTH: 25 base pairs 

CB) TYPE: nucleic acid 
CO S7RANCECNESS: single 
CD) TCPCLCGY : linear 



(xi) SEQUENCE CESCRIPTCCN: SEO ID N0:6c: 
G&tGrGGATC G77TGCA7G ATR]A 25 
(2) INFCRMAJICN FCR SEQ ID NO: 57: 

(i) SEQUENCE CHflRCTtRISiTG; 

(A) LE^XTTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNES5: single 
CD) TCPCLCGY: linear 



Cxi) SECUENCZ CESCRIPTICN : SEO ID NO: 67: 
TATATATGCA TTCAGVGA4 CTCGTCAAGA AGGCGA 
(2) INFCfiMATICN FCR SEQ ID NQ:63: 
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Ci) SECUENCE CHARACTERISTICS: 
CA) LENGTH: 35 base pairs 
(8) TYPE: nucleic acid 

(C) STH^CEDNESS: single 

(D) TCPOLCGY: linear 



Cxi) SEDUENCZ C-ESCRIFTTCN; SEQ ID NO:68: 
ATAT^CXGA MCAOXC ATGMTA^G GATVC 
C2) INFORmnaj FCR SEO ID N0:69: 

0) SEQUBJCE GVWCTtRISTICS: 

CA) L3GTH: 42 base pairs 

CB) T'FE: nucleic acid 
(0 STPANCECNESS: single 
CO) 7CPCL0GY: linear 



Cxi) SEOeiCE CESCRIFTICN: SEO ID NC:59: 

TATATCC3GC CCCTAuAC: AOUnUiGT CCC7TCCGGG GT 

(2) INFLATION FCR SEQ ID N0:70: 

Ci) SECUENCE CHARACTERISTICS : 
(A) LENGTH: 42 base pairs 
CB) TYPE: nucleic acid 
CO STOCEENESS: single 
CD) TCPCLCGY: linear 



Cxi) SECUENCE CESCRIPTTCN: SEO ID .NO:70: 
ATATACTCGA GAGCAATbTC CGCAGC^CCA C113GTCAGG CA 
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(2) INFCRMflJTCW FCR SEQ ID NO: 71: 

(i ) SEQUENCE CHAR/C7ERISTICS: 
(A) LENGTH: 39 base pairs 
CB) TYPE: nucleic acid 
CO STRANGENESS: single 
(D) "TOPOLOGY: linear 



Cxi) SEQUENCE CESCRIPTICN: SEQ ID NO:71; 
ATAT^GCGG CCGCXATUT TGiGIsTTA GTC^GCATC 39 
(2) INFCKMA7ICN FCR SEQ [D »\C:72: 

(i) SECUECE C-APACTtR':STIG: 

(A) L3JGTH: 34 c^se pairs 

(B) TYPE: nucleic acid 

(C) SffiflCENESS: single 

(D) TCPCtCGY: linear 



(xi) SEQUENCE DESCRIPTICN: SEQ ID M3:72: 

ACCAT7AAT TAACGA 71SCC PCAA 

(2) INFCWATICN FCR SEQ ID NG:73: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 3^ base pairs 
(8) T7FE: nucleic acid 

(C) STKAMCEMESS : sincle 

(D) TCPCLCGY: linear " 
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Cxi) SEQUENCE DECRIPTICN: SEQ ID NO: 73: 
PCKAJTAAT TAfiGTATTGG CC0QV\TQ2G GTUT 
C2) INFCRmnCN F03 SEQ ID ND:74: 

Ci) SEGUENCZ CHflWCTEUSHCS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
CO STRANGENESS: single 
CD) TCPCLQGY: linear 



Cxi) SEOJBCZ CECRIP7ICN: SEQ ID NO: 74; 

CC^A™ -GGTCCTAiAT A^ATSC 

(2) INFCRMATICN FCR SEO ID ND:75: 

Ci) SZZU&CZ CFAR/SGtRISTICS: 
(A) LENGTH: 25 base -airs 
CB) TYPE: nucleic acid 
(C) STRANGENESS: sinale 
CD) ilFCLCGY: linear " 



Cxi) SEQUENCE CE5CRIPTTCN: SEQ ID NO: 75: 
COC¥GCT7 COEGuGtfG CCGCC 
C2) INFCRMATICN FCR SEQ ID NO: 76: 

Ci) SEQUENCE C-ARACTERISTICS: 

(A) LE>X=Tr-:; 23 base pairs 

(B) TYPE: nucleic acid 
CO STRANGENESS: single 
CD) TCPCLOGY: linear 
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Cxi) SECUBJCE DESCRIPTION: SEQ ID N0:76: 

c&gzatcc csmrzoi ICC 

(2) INFCRMATTCN FCR SEQ ID NO:77: 

(l) SEQUENCE CHARACTERISTIC : 

(A) LBO-i: 25 base pairs 

(B) TYPE: nucleic acid 
CO STRANCEENESS: single 
CD) TCPCLOGY: linear 



(xi) SEIUErJCE CE5C3IP7ICN: SEQ ID NO: 77: 

CCC^GCTT GTiSDOaGG ATuE 

(2) INFCf^ATICN FCR SEC ID NO: 78: 

(i) SEC19JCE C-ARAOFJSTICS: 
(A) L£>GTH; 25 base pairs 
(8) TYPE: nucleic acid 
(C) STRAMKNE5S: single 
CD) TCPCLOGY: linear 



(xi) SEOjBJCE DESCRIFTICN: SEQ ID N0.78: 
OXGS\TCC GTaCATGA AJTCC 
(2) INFCPMATCCN FCR SEQ ID i\jC: 79: 

Ci) SECIBCE CHARACTERISTICS: 

(A) LE>JGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STCWENESS: single 
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(0) 7CPCL0GY: linear 



Cxi) SECUBCE DESCRIFTICN: SEQ ID N0:79: 



CC^CWuT CO££TTAC 

(2) INFCRmnCN FCR SEQ ID NO:80: 

Ci) SEQUENCE G^nERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
(0 ST^MCEENESS: single 
(D) 7CFCLCGY : linear 
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(xi) SEQUENCE DE5CTPTICN: SEQ ID N0:8Q: 

UKCH/^G CGTGGCTTT TTCTTC 26 

(2) rNFCRMATICN FCR SEQ ID NO: 81: 

(i) SEQUENCE C-;AR/£raiSTICS: 
(A) LENGTH: 25 base pairs 
CB) TYPE: nucleic acid 
CO SmWEENESS: single 
CD) TCPCLQGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GXCTTA^ X2fiG£A£A GAATC 25 
(2) INFQRjWICN FCR SEQ ID N0:82: 

Ci) SEQUENCE CHWGERISTIG: 
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(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
CO STRWDEENESS: single 
(D) TCPGLOCT: linear 



Cxi) SEQUENCE DESCRIPTION SEQ ID N0:82: 

CCACWCTT (ZXZXZSi P&G 

(2) INFCft M ATICN FCR SEQ ID ,^D:83: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LE>GTH: 19 base pairs 
CB) T/FE: nucleic- acid 
CO STRANCEENE3S: slnqle 
(D) TCFCLCGY: linear " 



Cxi) SEQUENCE CESGIFTICN: SEQ ID N0:83: 
CmCG I GGCG GA i CCCCTG -jq 
(2) INFCFMATICN FCR SEQ ID N0:84: 

(i) SEOJEMCE (>tWTERISiTCS: 

CA) LBUIH: 24 base pairs 

CB) TYPE: nucleic acid 
CO STRANDEENES5: single 
CD) TCfCLCGY: linear 



(xi) SEQUENCE DESCRIPTION SEQ ID N0:84: 
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(2) IiramnCN FCR SEQ ID N0:85: 

(i) SEQUENCE O'^WTIRISTICS: 
(A) LBGIH: 16 base pairs 
CB) TYPE: nucleic add 
CO STRANCEENESS : single 
(D) TCPGLCGY: linear 



Cxi) SECteCE GE5CRIPTICN: SEQ ID NO:85: 

C^CGlJTCCTu AGJITSC 

C2) INFORMATTCN FCR SEQ ID ftf):86: 

(i) SEQUENCE C-AiVC^ISHCS: 
CA) LENGTH: 42 base pairs 
(B) TYPE: nucleic acid 
CO STRANGENESS: single 
CD) TCPCLQGY: linear 



Cxi) SEQUENCE 0E5CRIPTIGN: SEQ ID N0:86: 

TATATATAJC TCG^ACCGC OAG^TbiTC CCCTTCCAGC CA 

(2) NFCRMATICN FCR SEQ ID N0:87: 

(i) SE-XJE^iCE C-ARACTERISTICS: 
(A) LEMTTH: 38 base pairs 
(3) TYPE: nucleic acid 
CO STRANCEDNES5: single 
CD) TCPCLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 
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TATATATATG CGGCCGCTCA AT7ATUTTTC TCGTTOJT 
(2) INFORMATION FCR SEQ ID NO:88: 

(i) SECUENCE CHWOERISTICS : 

CA) LENGTH: 42 base pairs 

CB) TYPE: nucleic acid 
(0 STRWEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEOJENCE DESCRIPTION: SEQ ID NO: 88: 
TATATABATC TCGCGGGCCT TTACCXTOC TATC^GiGAT AG 
C2) INFORMATION FCR SEO ID NO: 59: 

(i) SEQUENCE G-AR^TtRISTIG: 

(A) L£ V E7H: 35 base pairs 

(B) TYPE: nucleic acid 
(0 STimEDNESS: single 
(D) TCPCLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 

TACGCCU7CA ATAOGcTTCA CTAAAC££C TUTOC 

(2) INFORMATION FCR SEQ ID N0:90: 

Ci) SE-JUEMCE CHARACTERISTICS: 
(A) LE>JGTH: 35 base pairs 
(5) TYPE: nucleic acid 
CO STRWEENESS: single 
CD) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEO ID N0:90: 

TCGnSWCG TATTC^QX GtPGTPCXA CTATT 35 

(2) INFCfl-ATICN FCR SEQ ID .NO: 91: 

(1) SEQUENCE CHARCTtRISHC: 
(A) LENGTH; 2Z'base pairs 
CB) TYPE: nucleic acid 
(0 STWCEENESS: single 
(D) TCPCLQGY: linear 



Cxi) SEQUENCE DESCRIPTION; SEO ID NC:91: 
CGTTI^GC;7 AKCGttTCT PC 
(2) IMFCRj v ATIOJ FCR SEQ ID NG:92: 

(i) SEQUENCE C-AWU&iSriS: 

(A) LENGTH; 42 base pairs 

(B) TYPE: nucleic acid 
CO STRANGENESS: single 
(D) TCPCLCGY: lineer 



(xi) SEQUENCE CSCRIPTTOI: SEQ ID NO:92; 
ATATflGAGu CTTMTTMT CinUiGVG WCCHPC TC 
(2) INFCR-'ATICN FCR SEO ID NG:93: 

(i) SE'ieCZ CHARACTERISTIC; 

(A) LENGTH : 35 base pairs 

(B) TYPE: nucleic acia 
(0 STONCECNESS: single 
(D) TCFCLOGY: linear 
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Cxi) SEQUBJCE DESCRIPTION SEQ ID NO: 93: 
ATATA&GCT CA3GGGTTGA MAGTOGC G^CCG 
(2) INFCFMTICN FCR SEQ ID ^:94: 

(1) SEQUENCE OWGERIS7ICS: 

(A) LENGTH: 46 base pains 

(B) TYPE: nucleic acid 
(0 SiMCEENESS: single 
CD) TCPCLOGY: linear 



(xi) SEOJENCE DESCRIPTION: SEQ ID ^:94: 

TATATATTAA T7AAATAGAA TVCXZiX: TC^GACAATG CGATGC 

C2) INFCRMAJICN FCR SEQ ID j^JC : 95 : 

Ci) SEQUENCE OtWGBISTICS: 
CA) LENGTH: 40 base pairs 

(B) TYFE: nucleic acid 

(C) STRWEENESS: single 
CD) TDPCLCGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:S5: 

ATATACTCGA GTAGCAA7G3 TCAA^CC^GT AACGTTATAC 

(2) INFLATION FCR SEQ ID ND:96: 

Ci ) SEGUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 



WO 99/18226 



PCT/US98/21062 



36 

(B) TYPE: nucleic acid 

(C) SiRWEENESS: single 

(D) TmiOGY: linear 



Cxi) SEQUENCE DESCRIPTION: 5EQ ID NO: 96: 

gccottul Mtniiiib gcisoigct ttccosg omc 

(2) INFORMATION FCR SEO ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH; 39 base pairs 
(8) TYPE: nucleic acid 
(0 STRWCElNESS: sirale 
CD) TCFCLOGY: linear " 



(xi) 5EOJEMCE DESCRIPTION: SEO ID N0:97: 

wLWw rbK^W-ULij (jjuibjiuu TuJiUTCoi 39 

(2) IKFCPMATICN FCR SEO ID NO: 98: 

(i ) 5EOJGCE OAR^CTERISTICS: 

CA) LENGTH: 3S base pairs 

CB) TYPE: nucleic acid 

(C) STR*NCECNESS: single 

(D) TOPOLOGY: linear 



(xi) SECUENCE CESCRIPTICN: SEO ID N0:98: 
ATATA2GCT CT\/WG£A TGATTTGATT TOATCC 
(2) INFORMATICS FCR SEO ID N0:99: 
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CD SEQUENCE CPARCTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
(0 STW1CECNESS: single 
(D) TOPOLOGY: linear 



Cxi ) SEQUENCE DESCRIPTION: SEQ ID N0:99: 
CG0GCCGV\T TCTG3G03CT CACMTT0G3 
C2) INFCRMATICW FOR SEQ ID NO: 100: 

(i) SEae.CE CHARACTERISTICS: 

(A) LBJGTH; 7 ami ro acids 

(B) TYPE: amino acid 
CO S7RWENESS: single 
(D) TCPCtOGY: liner 



(xi) SEOBJCE DESCRIPTION; SEQ ID NO: 100: 

Pro Lys lys Lvs Lys Arg Lys 
1 5 

(2) INFCKFATICN FCR SEQ ID NO: 101: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8000 base pairs 
(BJ TfPE; nucleic acid 

(C) STRANGENESS: single 

(D) TCPCLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
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ATTGAGGXG TAGTACACAC TATTGAATCA MCm&C OWTCSXT ACCATCACAA 60 

TCSASVGCC ASTAGTAAAC GTASmAG ACCCCCiGJG TCCGmmC GTCCACTGA 120 

AAAAAAGCTT CCCGCAATTT GW2TAG7AS UOGCPGH' (XTCCAAAT G4XATCCTA 160 

A TCCCS&GC ATiTTtGCAT CTCGCCiGTA AACTAATCGA GOTGAGGTT CCTACCACAG 240 

(PCGATCTT G35CATAG3C ASO»C0G3 CTOGTAGAAT GTTTTCCGflG CACOSGTATC 300 

ATTCTGTUTG COCCATCCGT ASTCCAGAAG ACCCGGACCG CATW15VW TACGCCAG7A 360 

AACTCGCGGA AAAAGCGiGC A^GATTACAA ACAAGAAC7T GCAT12GAAG ATTAAGSA7C 420 

TCCGGACuTT ACTiBATACG CCGGATCCTB AAACiCCAIC GOTSuTT CACAAG3ATG 450 

TTACC7GCAA CA7GCGTGCC GAATAT7CCG TCAIGCAG3A CGTETATATC ACOGC7CCCG 540 

GAACTATCTA TCA7DGGC7 ATCAAAG3G TGC3SJCCT GT/OGKTT GGCTCGACA £00 

CC2CCOSGT CA7G7TC7CG GC7ATG3CAG GTTCGTACC T1XGTACAC ACCAiCTGGG 6c0 

CCGAGAGAA ASTCCTrGAA GuxGTAACA 7CGGAC7TTG CAGCACAAAG CTGASTGAAG 720 

G7AGGACAGG .AAAATTGTCG ATAAIGAGGA AGAASGAGTT GVGXC33S 7C3C53S777 7S0 

ATTTU7CCG7 A33AT&CA CTTTATCCAG AAC^GG: CJGTiEM AGC7G3CATC 84-0 

T7CCATCGGT GTTCCAGTG MTGSWGC AG7GGTACAC TTGCCGCTGT GATACAG7GG 9C0 

TCAGTTGCGA AGGCTACGTA G1GAAGAAAA TCACCATCAS TCCGS3GATC ACG33=GAAA 960 

CCGTGGGATA GC-CGGTTACA CACAATAGCG AG33CTTCTT GCTA7GCAAA GTTACTGACA 1020 

CiGTAAAiGG AGAAC5GGTA TCGTTCCCiC CATCCCGSZC ACCATATCCG 1080 

ATCAGATGA; TCGTATAATG GaWHSTA TATC^CCT&A CGATCCACAA AA4CT7C7GG 1140 

TiGGGCTCAA CC^GSAATT GTCATTAACG GTAGGACTAA 2GGAACACC AAIXGATCC 1200 

AAAA-ACCT TCTGCCGA7C ATAGC^CAAG GG7TCAGCAA AIGG3C7AAG GSGOEAffiG 1260 

ATGA7CT7GA TAACGAGAAA ATGC1FGGTA CTAGAGVG CVGCiT ACG TA7T33CTGC7 1320 
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TUTGGGCGT7 JUZMJWG AAAJTACATT OGTTtTATO CCCACCK3GA POxAKH 1380 

GCGTAAAAG7 (m&ZXT TTCGCGGTT TTCGCATGTC OTCGTAHS AGXCTUT 1440 

7HIGA7U7C GCT&G3OT mnmC TCGCAJ7T3CA ^CCWGVG G^MW 1500 

TCOGC^GGT OUa^GSV\ TTAGTD\TGG AGGCDW5: TQGTTI^G GATCCTUiGG 1560 

fiGMZCK /£CG£GVG CTCa^GVG QKTKTXL ATTAGIGGCA G-WAGGCA 1620 

CGC^GVGTT GTCIGCGViG TT£*££7 CC/XIIGGt ATUi^GC^G 16E0 

GAT7AGTT&A WCCCCGCG: GGfcmAA GGATAA iACC TQVGCVAT G£CGTA7GA 1740 

TCGGAC^GTA TATCGT7U7C TCGGCAWT CTUTACT^A GV\iGC£W\ CTCGG^G 18C0 

CGC^XGCT AOXATG^ G7TAAGATCA WGXXTC CG3V&TCA SMSmAG I860 

CGGTCGA^C ATACGACGCT AA^GTAC^A TGCCrGu^GG .^CCGTA CCATCGCC^G 1920 

AATTCCTAGC ,02^GT^G /£CGCG^CJ7 TAG7G7ACAA OSW&GrG CTTblG^CC 1980 

GCAWA7A CCiCA~:: ATbCATGGCC CuSCCWSA. 7ACAGA*G£ GAGG^CICA 2040 

A3G77ACA4A 3GC^GC7T GC^GAA^G AG7ACG7G77 Tb^GTG^ AA*VCGGi7 21C0 

GCG77AAG-A GSVGWCC TC^G7C7b3 TCGTOU^ ^O^CC AACCCXCC7 2160 

ATCATG£C7 AGuCmjSS GG^O^ CCCGACGGC GuTCCOG^C *AGG7GGA^A 2220 

CMTAGGrGT G\TAGGC^A CCG3GGTCG3 GCAAGTCrGC TATTATCA^ TCACTUTCA 2280 

CGGC^GA TCTIGiTAlC AGCGGW& A^GAAAATTG TG3C&AATT G^GCCGCG 2340 

TGC7AAGAC7 GAGGGG7A7G G^TOCGT (&fiGXXZ AGATTCGGT7 A7GCTCAACG 2400 

GA7GCC-CAA A£C£TAGAA GTGCU7ACG ITGACGA^GC GT7CGCGTGC CAGC^G 2460 

CCTACrnGZ G7GAT7GC7 ATCGTC^GGC CCGGCWU. GG7AGTAC7A iGCGSX^CC 2520 

CCA7GCAATG CGGAT7C77G AACA.7GATGC AAC7AAAGJ7 AGATTTCW CACCCTGAAA 2580 

CXCA^CA T7CTACW ATATC7GCG GCGTJGCACA C/GCC^GTTA 2640 

CAGC7A77G7 ATCGAC^Gb GAT7ACGA7G GWGfiG=A AAGXGVC CCG7GCAAGA 2700 
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PGWATKA MTCGATATT MXZZXA CWGCCGM GGM3SAT A7DVIUX* 2760 

CATUTTTCCG CGQaiuGGiT /VGCAATTGC AWC£0 mSXAT GWTMTO 2820 

C^GCCGCGjC CTACC^A A^G^GUTA TK&TCCGS C/WWGTEA 2S80 

A7&VVV£CC /iCIUrACGCG ATC^GATC/C AGCATGT1&A GGTGT7GCTC AXCuXI^ 2940 

>^W<7 ;OI!G^M #uIG3GG GCMCCA7G GATTA«3M CTOCTVO 3000 

TACCTAWG AMCTJVG (UPGATPG ^OQ ATI&vaC AfiffiSWM 3060 

TTGCIbCW MXXZXC CTCCCGJTB CCAATCCSTT OGJl<A£G ACCWGTTT 3120 

GCTGGGSA .^GCATTGGAA CCGATA3G CC*2E22G TATC3TAC7T ACGG^TlC 31S0 

ACH^CCCA OaGTHuSS ATGiCWC ACATPCGGCC ATiTACGC'CT 324) 

TrGACGTAAT T7£CATTAA] TTTTTCGSCA TGGACTG^ M£3£C75 T7T7QWC 33CC 

AGSCA^CC .^WCGTAC ATX^G GCCG^GCT CATTGS^CA 3260 

ACAGCCC^GG A^CCGCW TA7G3G7ACG TKCGCG£V\ CTQCCCGTA 3*20 

GATTTCCGG7 GT7C3GCTA GCTGGGA^Gj GMXVCi TT^.TTTGC^ AC3GG^A 34cG 

GX^GTTAT uUTGCAC^G CATA^CCTGG TCCC3CI7&AA CCCCAATC77 CCTtrCGCCT 35^0 

TrGTCCCCGA GTCCAK3S5G ATCCAACCGG GdCGSICa AWTTCHb AACCXTTCA 36C0 

A^OXTC AGTACTTGiG GTATCrG^G AAAAAATT&\ AjCTCCCCGT /VJPGAATCG 3€c0 

mzxiaz cccgattcgc atagccgot gg™&a cvcmick GcrmDQGsr 3720 

TTCCGCCGCA G3C-CG3TAC G^XiliiGT TCATCVCAT ]£^ACTAAA 7?-CG'/YCC 37S0 

KZXJTTCA GCOIGAA G^CCA 1GCGG CGACCjTAAA A^COTi CG ZZnCZCX 3840 

TCAATTCCCT TAAJC^GGA GCOCIuGG TCG7&VGTC CTATGGCTAC GCCGAC0GCA 3900 

JOGRSaGGA O^GTCXC GCTUTTGCCA GAA^JTTTbT C^TUTCT SXCG^ 3960 

CAGATTunrr cr^gcaat acwatut acogatut cc&gacta gxaaggz 4020 
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GTACACG3CA ATKXDXG (XOTCTGA ATTGCGTGAT TTCGIGCGIG TA1&WA 4080 

OU&GAKZ XmflGZ GCGCGGTOT AOKXCAA /WffiSQW AWJ&U 4140 

GTDWSSA MJXXCCA ATCQjCTGGG TAGJCCiGGC &VGS=G7CT 4200 

GCOSIGCCAT CTATAAACGT miZUXA CTTTTACCGA TTG2GCD5CG 4250 

CGC/VGAAT GCIUIGTCC CTA3SAWA A4G7GATCCA CGCGGTCGGC CCIEATTTCC 4320 

(SVGXCC XMVGTA GC-CTiGWAT TCCTACAAA4 CGCCT/OT GC=GIGG0>G 4580 

AOTASTAAA TGWCflWC ATDVGOC TCSTATTCC AOGCTATUi AC^GGGATTT 4440 

ACSXG3G AAWCOS: CTTGA^G7AT (XTTAAGG Cn&CAACC GESTAE 4500 

GAACTGACGC GMSTAACC ATCTATiGCC TGGATA^GAA GnGGJAGGAA ^GAATGAG 4560 

GGGuXXa AOTAAGGAG TUiGWM AGC7C/VGGA TWGAIATG SSATGiG 4620 

A71PGTTAI7 ATGGA7CCAT CO^CAGTT GC7TGA4333 A^GWGGGA T70C7AC7A 4680 

CWAS^AA ATTETATTCE TACTTCGW GXCAAATi CMCVCK GCWAGiCA 4740 

HSGG^T AAAGGTCCTB TTlCCTAATC ACC^AAG TAAI&VCAA raJTGTGCCT 4SC0 

AGATA7TGG TGAffCCATG GWCAATCC SSWGIG COGSTC^ CA7AACCG3T 4£60 

C3TTTAGCCC GCCCAAAiCG TTGCCGTGCC TTTGCATGTA TGCCAT&iG (DGAA4SG 4920 

TCCCAG^CT TASVGCAAT AACGTOAAG /vsGTOrar ATGCTCCTCC ACCCCCCTTC 4980 

CTAASCACAA AATT,VGAAT (TTTCASAAG TTMUGX GWAG7C CTGTTTAATC 5040 

GxACJCTC: CGCA7TCG7T CCCGCCOGTA AGTACATAGA AGTGCQSGAA (X-CC7ACG 5100 

CTCCTCCitC fiVCSZ&G ©GSCCCCE 1 A^GTfiSTAGC &OCCGTC4 CCATCTACAG 5160 

OGATAS CiOaCn&T CTCC^a TCTCCiGA TATGSAT&iC 5220 

GCTC-OTTT TTCSGCTTT AGCGGATCGG ACVCTCTA7 7AC7AG7A7G &SCAG77GG7 5280 

csxxskz tagttc^ta &^tagt/:g agg&wsu ggtgstgtg GciG^Grrc 5340 

A7GCCG7C; .^GCCTC-CC CCTATTCC^ CG2CVGGCT AA/WGA7G GCCCGCC7G 54C0 
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MCGGWG AAWXCCC ACTtMICGG CTCTGflGTH CKDCJCT ^60 

GTnHJTSS GGTATCCATC TCCOm^T CAA7T77C& G£=G^CG gCCQGCDVjG 5520 
CaGCGCTACA .CCCCIGGCA AOGGCCCCA CG^TUim: TA7UTUTTTU QKTCGTTTT 5580 
CCG*CG£^ GATOA7M O&iGCCGCA GWACTGA GIOIMIC GTCCiG™ 5640 
GATD\TTTCA /£G£xSV\ GuWCTOA TTATATUJiU CC&\T(XCC GTATUmTC 5700 
CTUfACGCAA (&G&X£i AG^CGC^GGA GCAGMK TGWACTGA (TA^CCGGGG 5760 
TOGTCGJTA CATATTTTCG ACaS^AOG GCCCIGGGG\ CTTGCAWG A/£TU£Trc 5820 

TGC^CA GCTTACAGAA CG^CCTTu] AGXAATG7 ATOTCCCC 5880 

CGGibCT GSi. OTGiM G^GGWW TCWCTC^G OT:*G\7G ATGCCCXCG 5940 

/WxDViCAA A/OTJGSTAC C^TCTCG7A AATOGWA TCAGWGlC ATA^CTC 60G0 

mXItCT GiCZ&CTA C&CIGTATA tOCmx WIOGZA &WGC7A7A 6060 

fi&TVOJA TCC&VACCA TTUTACi CCA GGCGWT^ TCCGATCC^C 6120 

AGHIXIUr AGjGTUUT AACACTATC TCCATGSlA CTATCCSCA GWGCATCTT 6180 

ATC^GATTAC ^CMTAC GAlbCTTACT TEGATATG^7 ,^Cu]^CA GrCuTTGCC 6240 

TCGAT/OSC AATTTUTbC CCCGCTA^ TTAGWTTA OC&VVVW\ CATG^GTATA 6300 

MCm&tt TATCCGG^T GCKT7CCAT (^GCGATCCA GWXGCTA C^MIGTCC 6360 ■ 

TOTTGCCGC ACTAAAAGA MTTGCAAGj TCXGC^T GCSIGAO; CEMXTG3 6420 

ACTC^C ATTWJUTC GAAiGCmC GAAAATATbC AlUTAATGiC &£TATiG3G 6480 

AGGPG7TCGC TCGGA^GCCA ATTAjGATTA CCXT&GiT MXCGCA TA1GTAGCTA 6540 

GAC7GWG3 CCCTAAGGCC GCCGCXTAT TRBCWG^C GTATAATTTS GTCCCATTGC 6600 

MSVGTKC TATC&m TTCGTCA TGG ACA7GAWG 5TOXCAG 6660 

GGOAACA tfOGWU TAOAGTCiT .^WGCCGCA GWICTGG 6720 
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csogctta cttatccgs attohi mtogtqgg tightm gccgtcttcc 6780 

TTCCAAACAT H2ACACGCTT TTTCACATCT OGGCSMGA TJTH2ATCCA ATCATAGCAG 6840 
A^CTCAA GCWG30GAC CQ32TAOGG «MS4TAT CGCATCATR: S«CAAAA3:C 6900 
MX&CZ TATGGC5TTA ACQS™ T5ATCTIGGA GffCCTGST GTGOTWC 6960 
WXACTCGA CnUTCM TCGSrmG G=6AAATATC ATCCACCCAT CTACCTACQj 7020 

GTACTC377T TAAATTCGGG GCGAIGAIEA AATCCG3AAT GTKCKXA CTTTTTGTCA 7060 

^GTTTT GAATCTUUT ATCGCC4EA G9GWCWK AJ«3ECTT AAAAC5TCCA 7140 

GWEraMT GnCATTGS: GX&CAKA TCATACATCG AST^CTAilT SCAWGAAA 7200 

TCG^&G GIG0GCOS0C TCGCTCVCA GATCATCG4: G3GIWTCG 7260 

GISCWMC ACCTrACTTC TGCGGCGSAi TTA7C7TGCA /SGATTC33TT PCITCCC^ 7320 

G^-CCGCGT GGCGGATCCC CT5A4A433C USTTTA^T SSTAVCCG CTCCCSGXS 7380 

fi&GXCA t&C&GC ASVG*££ CiCTGCTAGA TGWOWG G05TCETTTA 7440 

g*wgstat a^ggcct ttag:^ convex ccgotg (JT^TA 7500 

TT«XCIGT CCTAGGGCA Ti^GWCT TTGCG3GG CAAV&GCA TTCMffiCtt 7560 
AATAASGCAT CTCTAOUTG GTCWMTA GTOGWTO TTCATTTCAT 7620 

(^CTAAIA CTACA4XC ACCCCATCA ATASAGSAn CTTTAACA7G CTCQSOCKC 7680 

GCCGCTTCCC GGDCCOMCT GCCATGTGSA GGCCGCG3AG A43SG3M GCGGCCC03A 7740 

TSCTE0CG CWGSxIG GCTTCTOM TCCASGVCT QACC^: CTDAGTECCC 7800 

TA^TTCGJtfG^AS^ 78g0 

ASAASZAGG: GCCCVGCAA CCXCGAASC CGVGWCC AAAAACGCAG (2GVGAAGA 7920 

AGA4GCAACC TCOAAAACCC AA^CCCGGAA AGA&CAGC3 CATGSCACTT AACTTiGGAGS 7980 
CCGaGASATT GTTO&iCGTC 

8000 

(2) INFDPMATICN FCR SEQ 10 N0:102: 
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(i) SEOBJCE CHWCItRISTIG- 

(A) IBGTH: 8000 base pairs 

(B) TYPE: nrleic acid 

(C) SmmmiESS: single 

(D) must: linear 



Cxi) SEOBJCE DESCRIPTION: SEQ ID NO: 102: 

attocss rnxxx mmiCA a^c gktigmt kmxm eo 

A^WC GMOT-G AOHCW TCuJTTTUTC GTGCAAGGC 120 
ATGCC^C ATTTTTGCAT C^OGTA A^ATI^ GCTGG^GGTT CCTACCW3 240 

™^CCCCA^^^ 36Q 
*CT£Sa MWuTO AAGA77ACAA ACA<G^ GCATI^GVG 
7GS3XSST ACT^TACG CT-G AW^TC GlOSuTT OCWHTB 
TTACCTGCAA GATI3GGTGCC GAATATTCCG TCATGC^G2A CGTCTATATC AACGCjIXCG M 
GMCTATCTA TCATCAGGCT ATI^WGGCG 7GCG2ACCC7 GTAC7GSA7T GGCTTG^CA 600 
CCTOT CATGTTUOG OT ^G GTRSWa TGfflWC ACOO^ 

ATOTTW GCGCGTACA TCG^TuG «3MWG 
GMSraS AAAATTGTCG ATAATG^GGA AGVG^ GVGCCCGGG TCGCG33TTT 
ATTTUTCCGT AG^CA CTTTATCVG AAC^: C TO ^ AGOT 

TTOorasr cirrccxnG mubwqc ahistoc toeict gatac^g 

TTOGA A^acSTA GTWG^ TC^CATW TCCCGG^TC ACGGGWA 



420 
4S0 



660 

720 

760 

840 

900 

960 



WO 99/18226 



PCT/US98/21062 



45 

COTG33ATA CGCQOTCA CXMJPdB #2XTTOT GOA7GCW GtTfOGXA 1020 

C^GTAWGG ^WuaJTA TGJTTCCOT TUIGOWA CATDHSCC ACCATATGCG 1080 

ATOGATG3C THITATAATG GCO££\TA TATOCOG\ O^TCCACM A/VCT033 1140 

TTG33CTCA4 CC^GCGAOT GKATTMC& GT/CTM OffiVOCC /WCCATOC 1200 

AAMTTAU 7CIGCGGATC ATAEACMG (SHO^CM ATOGTTA^G G^GCGOVGG 1260 

ATGATUTTCA TAACGWA ATCGG3CTA CIPG€MZ WGJTML TATGGCTGC7 1320 

7CIG3GCGTT JUlXTttG AAAGTACA7T CGTmTATUj CCCXCTG3A ra^CCT 1380 

GCSTAAAtfT aiC^TTCT TTTPGJUT TTCCCATG7C OTO^ATGG /C&SCGU7 1440 

7KCCATG7C QO^GGC^G AAATTkWC TGGCATTIjCA ^CCAA^G 6=<£VVWC 1500 

TCCTCOGG7 GOE^A TTAG7CATGG TGCTTTTGX] GATGCT C^GG 1560 

M&fiCZX *GS£^G GGSttVG (X77UXC ATOGIESW SCWSGSW 1620 

TUSffiGKC CGC^GTT GTCiGCGWSG TIESGGST CCAGG0GS3C ATCQG=GCAG 16S0 

CA7TAC77GA PACCCGZX GGiTCACGTM GGATMTACC TCWC¥AT (XCGTA7GA 1740 

TCGSAC^TTA TATCGnuTC 7G2CCAWT (X7GC7&A GV\TGCCAM. C7CGC^C-£ 1800 

(BOOST ;£OG47C*G GTTA/^GATCA TAAC^CTC CGWGATCA GCAA3GTACG I860 

CGGTCGWC ATOQ3CST AA^GTAOGA TCCC^GG ^GGIGCCCTA COTGGCC^G 1920 

MTTCroGC ACIG^GK^G AGCGCCXbf TAG7UTACM TTOWCC 1980 

GCAACTATA aXATTGCC ATCHIGGCC CCGCWOA TAC^^G G^GTAa 2040 

/ ! GGTTACAAA GGtfG=GuT GC^AAC^G AJTACGIUiT TG^GX A^GVGOJTT 2100 

QCuTW&A a^AGCC TC^CIGG TCCTCT CG33 .^AACitXC AACC :TCCCT 2160 

ATCATGAGCT AGCTCTGM GSOM^ CCCE^COGC GGTCCCGTAC AAGGTCGAAA 2220 

CAATAGG^JT GATOGMA CCGGGGTCGG CWGTOGC TATTATCW TCAACTtJTCA 2280 

CQXCGAS; TCTTGTTACC /^GGGWGA AAGAAAATTC TCGCGAMTT QQQ3CCGACG 2340 
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T&JMGXJ G^GGjGTATC CXATTKnr (Z^t&CM A^TTCuGuT ATCCTCME 2400 

GATGCCXA4 PG3JII£&A GIBGGTOG T7OT>W£ OTHHGC CXQMSG 2460 

cxrAcrra: cttgattgct atojttogc cccgcvov\ qet^/oa tuq^gxi 2520 

ccatgcwg CQSMirm: a^caigatgc mjmn pcattkaat caccctwa 25eo 

CXCVG^ TTCTACA^GT ATATUCCCG GlGTTGOO OXTCOTTA 2640 

WGuATTGT ATC&CCIG CATTACGATC &WGATGAA A^CGiCGW CCGH»a6A 2700 

p&tCAT£A MTC&TATT AC0GG3KIA CWGCCGAA (I0£3GGAT ATCATUXA 2760 

CATGTTTCCG CGGGTGGGTT AA^AJTGC AAATCGrCA TCCC3GACAT GVGTMTCA 2820 

C^CCGCGT CT(X¥G33 CTAACC^GAA AflGMTUTA 7HCG70CG3 OVVVV^A 2880 

ATSWCCC AGbTAOaGG ATDOTOG AGCAIGTbSA O^GTiGCIC ACCCGCXIG 2940 

A£AC^GGC7 #JTG7GGW\ ACCI7GC2GG GC&CCCATG GAJTMjC^G CCCACTA*CA 3CG0 

TXCTAW3 WCiTTCPG GCTACTAIS AG^CTGGGA ^GuWGC AfiGGGTOA 3C60 

TTEGTSCW AWGCCCC OICCG7G CC4ATCCG7T CCC-GCVG #CA#£117 3120 

GCIGGGCG^A AGCATTCSSA CO^TAC7>£ CMGGCCGG TAJCGTAC7T ACGGEiTGCC 3180 

/SGTGS^GuGA ^OGTTCCCA CAITTTECGG AIGWC /CA77CGGCC ATTCCGCCT 3240 

TA3rCGTMT TtGCATWG TTTTTCGGCA TGG-^CTTGAC /VGCGG-ACTG TTTOVVC 3300 

/i&JGCATCCC ACTAKETAC CATCCCGCCG AT7GAGC&G GCCG^GCT 0\TO3^CA 3360 

AC2GCCC2GS .AraCGuVG TATCGGTACG ATOCGCCAT TGCCGCCGAA CTCTCCCGTA 3420 

GATTTCCGGT GTTCCAGCTA GCIG3SVGG GCAC^COT WTTGG^G 3480 

CCPGAG77A7 CTCIGC^G CATAACC7GG TCCCGGTi^A OE^AIOT CCKX5CC7 3540 

T/GTCCCCGA GTAC^GGmG A^GCWCCG GCCC3S7CAA AAWTCTO AACCXHTTCA 3600 

AACACQ1C7C AGTACTTCIG GTATC^GG AWATibA ^C7CCCCGT A^G-^GAATCG 3660 
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MTGGATCGC CCCGATTGGC ATflGODQGTIG CAGATAAGV\ CT/CWCTC QCTTTCGGjT 3720 

TTCCGCCG^ G2XEGTAC G^CTGuIGT 7CATCAACAT T&MCTAAA TtO&AfiCC 3780 

AaXTTiDA GC*Gi££AA G^CATCCEG (JKHTm ACCCTTTCG 0GTTCG3CCC 3840 

1WT7GCCT TMXC^A QSXCCTCG TISTGWTi; CTATCGTOC GCCG^CGCA 3900 

ACWG^A COTGTIXC GGTCTTGCCA GWGTW 0£GGTUTCT GCPGCG^G^C 3960 

C^TTCTUT CTWGCAAT ^GWTCT mGATTTT (TO^CATO G/SGWGCC 4020 

G7ACC3GCA ATTCA2CCCG C^CATUGA ATIGCSTGAT TTOTCCGTG TATG^TA 4080 

CWGATG3 .CTni^GCC GCGCO^CAT ACCGCTOA WGZXAAT ATTCCTGiC: 4140 

GTi^GGA AGC^JTibTC ACCGC=GlCA ATCCGGS3G WKCSGS: GVG^GT CT 4200 

GCCTTbCOi.7 CTATAAACGT 7GGCCGACCA (JTrTTCCGA TTCXxC^G G^CAGGCA 4260 

CCGC^GMT GOJTETCC CTAGGW& A^.TGCA 'ZGCGGTTCGGC CC7GATTTCC 4320 

(EUGCXC: fWGCj&A GCCTTGAW TKTACAASA CGCOTCCAT S^GIGGM 4380 

ACTAGTAAA 7WCATAAC ATCW7CTG TGx^TTCC AGGCiATCi ACaGGCATTT 4440 

ACGC^GCCGG AWG2CCGC OTWG7AT CCfTAOT CTTG^CAACC GCGC7AGCA 4500 

GACTMGC GGACGTAACC A7CTATTGCC TGGATAAG^A CTGSV^ AGV\TCG£G 4560 

CG3CCTCGA AOTA^G TCTGTAACAG AjCTCAaGGA TCA*GATATG GAGATCC^CG 4620 

ATG^TAGT A7GGATCCAT CV&VGH GCTIGWGG AAGWGGGA TTCAGTACTA 4680 

WAAGGVAA ATIUTATTCG TfiCUC&fiG GtXCAAATT CGATCAAGCA {^AAA^A 4740 

TGXG^GA.T AWGTUC7G TTCCCTMTG ^CAGSWG TAATG^ACM CTTTfUTGCCT 4800 

ACATATTGGG TC^CCATG GVGC*A7CC GCGAAAAGTo COSGTOGfiC CATAACCCGT 4860 

OGTCTjfiGCCC GCCCWVSCG TTGCCGTCCC TTiEMGTA TGCCATCACG CG£AAAGG2 4920 

TCC^GCT TAGVGCAAT AACJTOWG AfiGiTACAGT A1H7CCTCC ACCCCCCTTC 4980 

CTAAGCCAA AA7TAAGAAT GTTCWGG TTCAGTOX GWGTACTC OTrnMiC 5040 
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OGC^CTCC CCCGCCCGTA ACTACATAGA AGTGCCAG^A CAGCCTACCG 5100 

CKHCCVX PQ&n&G OG3CCCCCG AAGTTUTAGC GflOSCCGTCA CCATCT/OG 5160 

ctotmx ciam^T gtoc^a tucacig^ tatcgatg* a^gcgvg 5220 

GCKXJTTT TmGJTT ASjQGATCBS ACWTUAT TACTAGfAiG GWTCGr 5280 

CGTOG^CC TAGTTCXTA GAGATAGTAG AaLW£A GGTGGIGOT O^CGre 5340 

ATGCCGTCCA AWECIGCC CCTATTOX CGCCWTT AAAGAAGATG GCCOGDOGS 54C0 

(^GCGGCVG AW&GCCC ACTCKCGG CIl^GTCC CTCCXUU 5460 

aTTTbGTGG GGTATCCATG TCCCTU2GAT CMTTnI^; CGE^GAGACG GCCCGCCAGS 5=20 

CrGCGGTTrCA ACCCCTm AC^GCCCCA CGSA7GIGCC TATGiCmTC GGATCbTTTT 5550 

CG^CGS^ GATTGATGAG GGSGEGtt GATTArCTGA. CTCTWCC G7CC7GT7TC 564} 

GATCATT7GA A03GGCGW GTG^CTCAA TTATATCGX CC&TCAGCC GTAiUTTTTC 57C0 

CCTACGCAA GC^CGT AGACGC^A GM3=G£C Tl^TAGbA CTAACCGSG 5760 

TAGJ7GG37A CATATTTTCG .CuX^ GCCCTGGGCA •CTTbCW^ AAG7CCGTTC 5c20 

TCtfGWCA GCT7AG3BA CCG£CTu£ AGCGCAATGT CCIGSWGA AnCATGCCC 5££0 

CGGTbCTCGA CACGTCGOAA G^GSWA: TCAWTC-S GTACC^GATG ATGCCfXQG 5940 

AAGCCAACAA M^flGGWC OOTTCGTA TO^GAWCC ATAACuO] 6000 

AGOGAOTCT GfCAGG^CTA CGCTJTATA ACTOXC^C AGATC^GCCA GAATGCTATA 6C60 

AGAICXCTA TCCGAAACCA ITGTACTCCA OTASCGTCC GGCGWTC TCCGA7UX 6120 

^rrcGcrcT agogtctgt aacaactatc tgcah^a ctatcgxa ctagcatctt eiao 

ATCAGAT7AC GATGCTTACT TCGATA1G3T AG^CG^CA GTCGCCTGCC 6240 

TGGATAGGC ACCITCTGC CCCGCTAAGC TTAGAAGT7A CCCGWAAA CATG^GTATA 6300 

G=GCCCCG^A TATCCGCAGT GCGGTTCCAT CAGCGATCCA GVOCKTA CWATCTCC 6360 
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TOTOCCG: /VCTAWGA MTHME TCACGO=GAT GCGIWCTG CCA/OCTS 6420 

JUOGZAC ATTWJIjiC GAATOCTTTC GWATATCC AIETMI&3C G^GTATTGuj 6460 

^GTTCGC im^GCCA ATTO3ATTA CGOG^GTT TGtVOJZA TA1GTAGCTA ©40 

G^CTGWGG CCOM3XC GCOLCTAT ITGCAA^ GFATMTTTG GTCCCAIXC 6600 

/V&WTGCC TAIGGATAGA TTGJTCA7GG /OTGVWG AXSIGW GTODJCCAG 6660 

GCCGWCA CACrCMM P&CZfififG TAGWIGAT /VWXXZCA GWCCCTG3 6720 

a^CTGCTTA uTATGCGGG ATTLACGGG3 AATIAGTGCG TA33CTDiCG GCCGTCTIGC 6780 

ttccwcat tcc^cglTt Tn^aiur assess ttttgatgca atcaiagc^g 6840 

AOCTTCAA GC-^GGCGC CCGST/033 A^CGGATAT CGCATCATTC G^WAGCC 6900 

A^G^CGC TA7GGCG77A ^CGGSTCTGA T^ATCT713SA 3SXIJGGG7 GTGGATGV£ 6560 

OOTCTCGA CTibATu^G TGCGCC7T7G G^G=AATATC ATCIXCCAT CTATTACG2 7020 

GT^crnrrrr taaattcggg gc&tgaiga aatccggaat gttcctcaca. (Thitgtca 7oso 

fC^^TTT gaathtcgiT atggcc^ca g^xtaga ^c<nrnr aaaacgtcca 71^10 

GAjbHal^G: GTTCAT^GC (^ZXAACA TCATACA7GG .^AGTATTT GACAAAGAAA 7200 

TGXTG^G GIGCGuXC TGGCTWCA TGG^TM GATCATC&C GCrGTCATCG 7260 

GT&G=Gtf C ^QTACTTC 7Gj3GCGG\T TTATCT7GCA ^TCGGTT #XrCCAC£ 7320 

CGTGCCGCGT QGCGGATCCC CIGWVGGC TGnTA^TT GQGTMCCG CTCCMCCG 7380 

ACS£G£CA AG^CGA^ A^ffiCG OUGCTAGA TGWCWG GCL7GGTTTA 7440 

G^TAGGJAJ AAC^C7 TTACOGTGG CCGiG^C CCGGTATG^G GTAGACAAIA 7500 

rac^c-mr cctaoggca ttgwot ttgccc^g cw^gca ttccaatca 7550 

TM^GjGGA AA.TAVGCAT CTCT/iaiJiG GTCCTAAATA GTG1GCA7AG TACATTTCAT 7620 

CTGOWTA OTCWACC /^vXCATGA ATAG^GGATT (TfTA^CATG CTCGGCCGCC 7630 

GCCCCTTCCC GGCCCCCCT GCCATGTGGA GGCCGCGM m^GL^G GCGGCCCCGA 7740 
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TtmirccG c^/ma^ 7800 

TPGKATT&j /OGGCACT /^CTCAAC GCCOTCTX AG2CCCGCCA CHaCGCD^ 7860 

/wgc^gg: gcccwcm (memo: avowee mmm gwgwa 7920 

PGMGMCC TGZAAAACCC AACCQIM AjXj^CAjOG CATCQCACTT 7980 

ccgac^tt Gnisiajn: 8Q00 

(2) INFCRMATIGN FCR SEQ ID NO: 103: 

Ci) SEQUENCE (>/MTtRIS7IG: 
(A) LENGTH: 11740 base oairs 
CB) TYPE: nucleic acid ' 
(C) STRWCEDNESS: sinale 
CD) TTPCL0GY: linear " 



(xi) SEQUENCE DECRimCN: SE0 ID NO: 103: 

ATTCACGGCG TPGTPOiQC TATTGAATCA WXZZX. (MiTGXT K£AJCXAA. 60 

TC33GVSGCC AG7AG7AAAC OTOO^SG .^CCCC^G TCCGTTT^C GreC^C 120 

AAWAGC7T CCCGCWTT G^AGTO C^GC^T CACTCWAAT (^CCATQTA 180 

ATCCC^GC ATTTTCGCAT CTGGCC^GTA AfiCTMim GCTm-ffiTT CCTACC/OG 240 

0*0*101 G&^TAGGC /CCGCXGGG CTCGTAGAAT OTnCCM (XZWAK 300 

ATTUTSTUU CCCCATCCGT ^CC*&VS ACCCSACCG OTO\Tt*M TjAffiCCAGTA 360 

AOSCG3A AWGCGTGC A/^TTACAA ACVGWTT GCA7GWG ATTAAGGATC 420 

T CCGXCGT AC17GATAG CCGSATCCR] AAACCCATC GCTCTGCTTT C^CAACGATC 480 

TTACCTGCAA CATGCGTCCC GAATATTCCG TOT13MB\ CdlGTATATC AACGCTUICG 540 

GAACTATUTA TCATC2GSTJ A7GWGGCG TT3CGGACCCT GTACTGcmTT GGCTTOGaa 600 
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(XKa^GTT CAIGIIULG GCTA7GGOT GUUHPCX TGCG7/WC ACOTCTGGG 660 

AGTCITTGAA QCQDGWCA TU&CTT7G CXZXmG CTWBAAG 720 

GTA03OGG AAMTTGTO ATATOGGA MiM &VGCCCGGS TUjjGjGTTT 780 

ATTTUTCCGT PGXK&CA CTTTATO^G MX^GC GXTTTS^G /iGCTQGCATC 840 

ttccatcgjT GircccnG mtoswgc mhujipqc ttgccgggt gatac^gtgg 900 

I&GTTGCGA AGGCTACGTA GIWGViM 1IACCATCAG TCCDQQWTC ^033^GW\ 960 

CDGTOGGATA (nH^O\ C^GVVTAGCG AG33G7CTT GCTA7GCAM GTXO^CA 1020 

C^GTAAAAGG AGUGSoTA TCCTIUTIG TGTGCaGJTA CATCCCGGCC ACGATATGCG 1060 

ati^gatgrc tggtatmtc gcccgsaja tatc^cctea c&7gcacaa aaagtgb3 1140 

t7g3xtcja ccagcgw7 gtclattaacg gik&ctta gksvcicc aacacc^tcc 1200 

aaaattag" tosccgatc atagxvg gtogc^a a1gggctaag tfgcgcaags 1260 

ateatctiga taacg^aa atcctgsgta oag^ag caagcttacg tatcsctgt 1320 

7utg33cg7t tcgcactaag aaagtacatt (h77ta7cg coxcigga -cgc^cct 1380 

GCGTAAAACT CCG^CTUT TTTAGCGC7T T7CCCA~C G7CCGTA7GG ACGf^CCTUTT 1440 

TCCCCATUTC GCTl^GGC^G AMTTGAAAC TCGCATIulA KWMWG G^GGWAAC 1E00 

TGCTGCAGoT CKJ£X£M TTAGTCA7G3 PGLCVGZ TGCTTTTCAG (&TCCTOGG 1560 

PG&COVG PGJ&GWG CTC0GWG (XTTCCXC ATTAGIGGCA (XAAAGGCA 1620 

TCGAGGCAGC CGCAGVGTT CTUGCGVG TGG^GGGGCT Qj>GS&C ATCGG^GCAG 1660 

CATTAGuGA AACCCCGCGC GJICAGGTM GGATAATAC: TCVGCAAAT G^CCHTATGA 1740 

TCGGAC^GTA TATCGmGTC TCGCCWT (HbTujGAA GAA7GCCAAA (TCGCAO^G 1£C0 

CSXCCGCT AGCAGA7CAG GTTA^GATCA TAACACAC7C CGGAAGATCA GGAAGGTAG I860 

CGGTOWC ATACG^CGCT AAAGfACTGA TGCCAGC^G AGGTOCGGTA OATCGCCAG 1920 

AATTCCTAGC ACiG^GH^G AGCGCGO" WiTGTACAA CGAAAG^G TTTG7GAACC I960 
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GCAWTATA O^OVmrC ATCCATOXC CCGCGWA TfOm^G VGVGIPCA 2040 

P&TTKMA GX^GCTT GC&WCG AGTACGnUTT TIXGTIi^ Afi&VSGGTr 2100 

GCGTTA^ G^GV(E TCCTTfUEG AM7G£C ACCCKUT 2160 

ATOT^GCT AGCTCTG2G QSO^\ CXOG^CCnX GSTIICOTC MGZKmA 2220 

CAAJPGXJ QATPGLXA CCG33G7CGG GCAATTCtfl TATTATWG 1WCTGTCA 2280 

CGGC^G\ ILiibii^C tGJZAAXA A^GWATTG TCGC&W\TT G^GXOXG 2340 

-nrrAAxr gpggj^atc c^ttacgx a^oir ^ttcgstt atittwcg 2400 

GATGCG^CAA AICGTAGAA GTGCIETACG TTCACWGC GITCGCGHX OCGC^GM 2460 

COTOTGC CTTCAlinT ATGJTC^GGC CCCCOVGAA GGTAGnACTA TC^SG^CC 2520 

CCA7GC*ATG CGGA77CT7C MCATGATCC A^OWGuT XATTTCiAT (XCCIGW 25S0 

A^GACATATG CXLA££A TTCTACVG7 ATATC7CCGG GCGTIGICA C^CC^GITA 2640 

C^GCTATTbT ATCGKXTG CATTACGATb &WGA11GAA AtfCACGW CCGJGCWA 2700 

AGAACATTGA AATCGATATT AC^GGGGCCA OWGI&b GCC2G33&T ATCATCCHGA 2760 

CATG7T7CCG CGGaTGGuTT A^GCAATTGC AAATCSJCTA TCCCG^CAT GA^GTMTGA 2S20 

CAGCCGCGGC CTC^VGGG CTAACC^GAA AAGGAGIUTA TKCTEGG OWWG7CA 2880 

A7GWACCC ACTGTPCGCG ATCCATCAG A3CATCJGAA COTiKTC ACCCGCACTG 2940 

K&OGTT AGTUIGSW\ ACCT7GCAGG GC&CCCATG GOTMaCAG CCCCTAfiCA 3000 

TACCT/WGG AWiTOG GCTACTATAG PGXOSZA ACCIWCAC AABGGAATAA 3060 

TRjCTGC^AT AWGCCCC ACTCCCCGKi CCMTCCuiT GGCTCCAAG ACOVCTTT 3120 

GGCGGC^A ^GCATIISAA CC&ATACiAG CCX2GCGGS TATCGTTAGT ACCGGTTCCC 3180 

AGTG^GC&A ACTUTTCCCA CAGTTTGCGG ATGOAACC ACATTCGGCC ATTTAIGCCT 3240 

TCACGTTAAT TTEIAITMG TTTTTCG3CA IGGACTIG^C A*GCGGA£7G TTTTCTA^ 3300 
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mmccz xifiAcnrpc catcccgccg attogcgag ocoGrarr cattish 3360 

PCPGjOCXZ AACCCGCW TA7G3GIACG AT1XGCCAT mm&A CXHXCGTA 3420 
GATTTlCGjT GITCOmA GC7GGGVG3 TG4TTTGCAG XTffC-X-AA 3480 

CC^GiTAT CTCTCCAC^G CATAACCTGG TCCCGaTI>V\ CCQGMTUTT CCTOCGCCT 3540 
T/GTCCCCGA GT^CWl^G AASMICG GCCCGGTCAA AAMTRJTG AACC^GTTO 3600 
ANOXIC AGTACT7GTG GTA7TOGG AAAAAATTG4 AGCTIICCGr AA^GAAIGG 3660 
AATQGATCGC CCCGAT7GGC ATAjCCGGTG O^TAAGAA CTACWCTG GCTTTCGGGT 3720 
TTCCGCCGdA G3CXGGTAC G^mSTUT TCA7WCAT TG3VOAAA TACAGWGC 3760 
AOXTTTCA GC^GTCCGAA G^CCATGCGG CG^CCTTAAA AAXCTTTCG CGTTCG3CCC 3840 
TGiATTGCTT TA^CCG^GGA GGCXCCTCG TGGIGAAG7C CTA7GX7AC GCCG^CCGCA 39G0 

AOGTl^GGA CGTAGTC^C GCTCTiGCCA GAA^GTTTGT OSGGTGTCT GCAGCSSGAC 3960 

OXBATTCTGT C7WGCAAT PCXmiGl -CCTCATTTT CO^CTA GACAAC^C 4020 

CTACACGGCA ATTCXCCCG CXCATCTI^ ATTCCGfiGAT TiZSTCCGTG TATG^GGTA 4080 

CW=G;TGG ACTTCMCC GCGCCGTC4T PCHZXSM A/SQGSGWT AT7GC7GAC7 4140 

GTCAiiGrGGA AGCAGTTUfiC AACGC^GCCA ATCCGOGGG TAG^C^GGC GVG^TCT 42C0 

GCCGTGCCAT CTATAAACGT iGGCCG^CCA GTTTTACCG4 GAGACAGGCA 4260 

CCGC^AGlAJ GACTUIGTGC CTA3G/WJ& A^GTGATCGA CGCG^GX COGATTTCC 4320 

QGVGCACCC PGMGVm GCCTOAAT 1GCTACAAAA CGCCTAXAT GC^GTGGC^ 4380 

ACiWAAA TGACATAAC ATCVGIUIG TCGCCATTCC A3TGCTATC7 AMGCATTT 4440 

ACIAGCCGG AWGiCCGC CTTGAAGTAT CXTTAACTG OTCWCC GCGCTAGACA 4500 

GWHXGC GGACGTAACC ATCTATTGCC IT^VTAAGAA GIGGACBAA AGAATCG^G 4560 

CGGCOTCA KJTMGX TCTGTAACAG AO^VGSi. TGAAGATATG GAGATCG^CG 4620 

ATG*G7TAG7 ATGGATCCAT CC*GAC*GiT GCT1WGGG AAGWG3GA TOCTOA 4680 
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GAWGGAAA ATTCTATTCG TATRLW OXOW\TT (XATWGCA (&m&CA 4740 

TGGCG^GAT AWjTCCTG TTCCOMTG XDG^mG miWCAA ClblGiUX T 4800 

/OTATTCGj TG=GWTG GWCMTCC GG&WGIb CCCQGIU^C OTAACCCGT 4860 

- CUIUTAXCC Q2LWACG 1TCCCGTGCC TT1GCATCTA TCCCATGaCG CC^GWGGS 4920 

TiX^CAG^CT TAGVGGW AAOTIAAAG ASGTTAG^GT ATGTKUCC ^GUCGTC 4980 

CTAAGCXAA MTTAtfiAAT GTTCAGAAL TTDWAC GWCTAGTC CTUnTMTC 5C40 

OGOOCTCC CGCATTCG77 CCCGCCCGTA ACTACWGA /£7GG>G\A (X3XTACCG 5100 

CTCCTCCTbC PGGZH^G GPGGCCCCCG A/GTIUTAGC &OICG7CA CCATCTAC-G 5160 

CTGATAAC-C (XGCHGAT GTCAC^CA TCTCACTG3A TA7GGATGAC ^GCBVG 5220 

gctcctttt ttc^gott agcggatcgg .cwtctat tactagtatg GC^ni^rr 52S0 

GJTUGGtfC TAGTTTCXTA GPGXTSGTAG ACCSV^A GG7GGJGGTG -GCTT^CGiTC 5540 

A7GCCG7CCA s&£CCTGCC CCTATTCCAC CGOMGCT AA/SGVGATG GCCdGtS 5400 

MWGCCC /CTXKCGG WGCAATO CTC7GAG7CC CTGXCTC7 5460 

(TmalTGG GGTATCCATC TCCC7CG2AT CAATTTTCGA CGG^GACG GCCCGCC^GG 5520 

C^GCGGTACA CCCTGGCA PG4GGCCCCA CGGATS7GCC TATbTCiTTC GSATCSmTT 5560 

CCG^GG^ GATTCATGXa (XflGCCGW C^GTAAO^ OTCGWCC GTCCTGTTTb 5640 

&TWTTil3A ACCG23CGAA GTGWCAA TTATATCGTC CCGATC&CC GTATUiTTTC 5700 

C^CTACGCM GC^CGT AG£CC£GA GO^GM 7GAATACTGA aA*CCG£3 5760 

TAjGTGjGTA CATATT7TCG PC^C^OG GCCCH333CA CmlOVWG ,WTCCGiTC 5320 

TCOG4ACCA GCTTAC^GAA CCGPCCniSG .^GGGC^A TUT COGGASAGA ATTCA7GCCC 5880 

CGGTOC7CGA CPCuTCGAAA GAGGWCAAC TCAAACTG^G GWMATC ATGCCMCG 5940 

MSXAfiOA M3TAGSTAC CA(TrCTCGl"A A/STAGAAAA TCiGWGCC ATAACCCTG 600G 
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Mx&CIXT QXXTATA ACTUGQX ASTCAGCCA G^ATQCTATA 6060 

AOTIXCTA TIIGWCA TTCTACTTCA CTAGCGTACC G30GACTAC TULATQX 6120 

AGTTCGOGr AGOGTCIUT AACACTA7C TGCATCWi CTATCCG^CA GTAGlATUTT 6180 

atdwtac lora^AC (^TirnAa ir^TATQir a#cg3gaca aramc 6240 

TGGATACTTGC ACCTTUGC CCGGCTAAGC TOGAAGTTA OISWWA CA7G=GTATA 6300 

&GCCCGGAA TATCCGMT GQGGTTCWT 0£CGATCCA GWXSTA QWW&TO: 6360 

TCATTuCCGC AACTAAAAGA MTTGCAACG TCXGC^T GSTTtVOB CCAA0O3S 6420 

ACRIAGCGAC ATiLAATUTC &V\HTTTTC GAAAATATGC ATGTMToAC G^AT7G2G 6480 

K&GnUZ TCG3WCCA ATTAGOTTA CC^GTT Ti^CGCA TATGTAGCTA 6540 

G^CGViAGG CCCIAAGGCC QCCGtXTAT TTCOAAGAC GIAIAATT7G G7CCCA71GC 6600 

AAGW^SCC TA7GGA7AGA TTlGTCATSG ACA i£VW£ AKGiWA GTTACXM 6660 

GCX^AACA G^^AG^A ^CGiAAG 7ACAAG7OT AC\^CGCA GWCCCTGG 6720 

CG^CTGCTTA CTTAiGGGSS AT7CACCGGG AA77AG7GCG TOGuTACG GCGJTCTTGC 6760 

TTTCArACAT TC^CGCTT TTr^CAkJ? CGGCG^GGA rTT7GAK£A ATCA7AGC*G 6840 

AAC^CTTCAA CWGGCGt CCGSTACTIBG AGACGGA7A7 CGCATCAT7C GACAWGCC 6900 

AA&iC^CGC TATCGGOTA ACQ^iTTI^ TCATGlGiA GS£CTG3G7 GTC^TG¥C 6960 

CAC7AC7CGA CTOTC&S TCCGCCT77G G fi GAAATA7C ATCCXCCAT CTACCTACG3 7020 

G7ACTCG77T TAAATT CGGG GCGATGA7UA AA7CCGSAA7 GTTujCACA Ul lii G i C A 7080 

ACAC^TuT GAA7U7CGTT A7CGCCAGCA &2GTACTAGA A&GCGGC77 AAAAGJ7CCA 7140 

GATUTGC^GC GTTCATiHS: G^GXAAGA TCATACA7EG fGifiGTATCT GXM&fiA 7200 

TGGCTG^G GTCCGuXC 7GGC7CAACA 7GGrG377AA- GA7CA7CGAC GCAG7CA7CG 7260 

m&GX£ tCCnXTIZ 7GC32CGGAT TTATCTTGCA ^\TTCGGTT fiOKDW 7320 

CuibCCGCGT GGCG2A7CCC GIEAWGGC TITuWrT GGG7AAACCG CTCCCAGj^ 7380 
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pamxzA tcxzu&c ccu&cnL corral tewc^ gojigito iw 

GSGTAGGTAT VOCZfiCT TTfiCOGRZ CCOT^C CCGGTATGAS GTA^CAATA 7500 
TTKACCM CCTACTGGCA T7GAGAACTT TTCCCCAG^G CA/W&CCA TTCCAAGCCA 7560 
TD1GSG3G2A AATAAAGCAT CTCWCQGIG OTTTAAATA (TOCCATAS TACATTTCAT 7620 
CIG^CTAATA CTAWCACC ACCaCCATGA ATAGASSATT CTTTAACATC CTCGGCCGCC 7680 
GCCCCTTCCC GGGCCCCACT GCCA7GTGS4 G3GCG032G iWSffiOG QGGGCCCCGA 7740 
TKCiECCCG QWESCT3 GC7TCXAAA TCCaGCWT C^CGCGCC GTOGTCCCC 7800 

TAG7CAT753 AC^GGCACT AGACCTCAAC CCDOCGICl ACGCCCGCd. COGDGCQXa 7860 

ASVGC^ GCCCaAGCAA CCACC&VGC GGVGAAACC AWACGC^G GAGSAGAAGA 7920 

AGVGCAACC TCCAWCCC AAACCCGGAA AG^GGG CATGS^ AAGTTCGAGG 7960 

CCGC4GA7T GTTCGACGTC AAGAACGAGG .CGS^HJT CATCSGXAC GCACTGGCCA 8040 

TGWGSW GGTAATGAAA CuOGGACG TGAAAGGAAC WTOBSCdC CCTGTGTAT 8100 

CAAAGCTCAA ATTTACCAAG TCGTCAGCAT ACGACA1GGA G7TUOW3 TTGCCAG7CA 8160 

ACATCHAGAAG TMKaTTC ACCTACTO q^^. TAWcns . ^ 

ACCACGGA3C GG7GCASTAT AGTGG^GGTA GATTTACCAT CCCTCGG3GA GTAGGAGGCA 8280 

CGGTCGTCCG ATCATCGATA ACTXEra QjTTCTCGCG ATASTCCTCG 8340 

(^GGCGCTGA TGAAGGACA CGACK3CCC TT1EGGT037 CACCTCSGAAT AGTAAAQGGA 8400 

AG^CAATTAA GACGACCCCG GAAGGG^C^G AAG^GIGSTC (GCAGCACCA CTIGGTCAGGG 8460 

CAATGTUriT GCTCGGAAAT GTGAGCTTCC CXSQJXZ CZZXXA TGCTATACCC 8520 

GCGAACC-C C^-CCCTC GACATCCTTG AAGWC3T GWCATGAG GCCTACGATA 8580 

COX-CTCAA 7BXATATTG CGGTCCGGAT CGICTSK AA3GVWGA A6CGTCAT1G 8640 

AGGAC7TTAC CCTGACCC-C CCCTXJTG1 QOMKTC GTTACTGCCAC CATACTGTAC 8700 
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CCrrc£TTD£ CCCIGTTAAG ATCGflQCAQG TUGGMCa PGJXK&tf fWXZXIPC 8760 

GCATAG^GC TTGCGCCCAG TTTGGATACG ACCWGGGj AG2GCVCC GCMCWF 8820 

accgctacat gthcttaag c^toca ccghta^ aggcaccatg gatcacatca 8880 

agattagcac cr^cqg tuta3wgc togctov\ ^t/ott ckoum 8940 

MTCCCCTCC ASS^O: GTAACGJITA GCATAGITO TAGWCTO SMS^A! 9000 

GTAGCTGjC CCGCAAGATA AAACCAAAAT TUjTCEGAOG GGAAAAATAT GA7UTACCTC 9060 

CCbTTGiCGG TAAAWATT CCTTGCAG>G TCTACQiCCG TUTGWG^A PCAfiCWG 9120 

TATbCX^GG CO^CS AGTT7 ATAC ATCCTACCTG GA^GV\TOT 9180 

CAGGSWGT TTACGCAW CCGCCATCTG GGA^GVCAT TACGTATG^G TCGW7GS 9240 

G^jAC^A GXTOIMI OTTGXCC GCXGGWT CflGGSHGC AGCGCCATCA 9300 

AGC^GTGZGT CKCTATAAG CGA^JT CTiDVOW CCGGACTfGA 9260 

TCAGAGA7GA CG^CCG GCCCWG^ MH^m GCCmOAG TTQA'CO^ 9420 

GTACCTGCAT GGTC0CTGT7 (xCCACGuGC CGAATbTAAT ACA.TGGCTTT AAACACATCA 9480 

GCCTCD3A TT AGAIAC^GAC 0*071^1 7G~CAGX O&^CTA G3GCOAACC 9640 

CGGYCG^G C^CTGMTGG ATCGTCGSV\ PGACSHO^G GiC&Om 9600 

ATBGCCISa ATACATA1G3 GGAAATCA7G XZJVGIVG GECTATGCC CA^G^GTOG 9660 

CACCAGGAGA CCCTTXG^ TGGCCACACG AMTAGTACA GCATTACTAC CATCGCCATC 9720 

CTGTGTADC CATCTTAGCC GTGGCATC^G CTACCGTGGC GATGMGATT GGCGTAACTG 9780 

TTCC ii GTG7T A7G7GCCTGT AAAGCGCGCC GI^GIGCCT G*CGCWTAC GCCOBGCCC 9840 

CWCGCCGT AATCCCAACT TGTiQGCX TCT7GTGCTG CGTTAGGTCG GCCAATGCTG 9900 

AAACGTTCX CG^CCATG tCTTKJm GCTOSAGG JOQmZ nCKa^CC 9960 

AGTTGTGCAT ACCTF1GGCC GCTTTCATCG TTCTAATGCG CTGCTGCTCC THTFGCCTGC 10020 
CTTT77TAGT GG77GCCGGC GCCTACCTGG C&AGsTXA CGXTACG^A CATGCG^CCA 10080 
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OCTTCCAM imZAOG ATACOSTATA XZZXUGT TGWQ3GO\ (ZSTATCCCC 10140 

anwTTi (zxatou ciiWTCnrr co&=G3iTn*Gcc^ 10200 

XAHPCHG CAAATTCACC dUIHUCC (UUZ£££A AATCWfGC TQjGGlIUJ 10260 
TH^ATCTO QCCQXCGCT CATH^CT ATACCTGCAA (£TUnO£A QGSJOflCC 10320 

ccrrrATUTG qgs^gcg gaatcttttt gcs^cwg; gwgx^g atc^gi^gg iogso 

(ZTXSKZA AmWZA GATTGGQGGT Ot^CCACGC GC^GGCGATT AAGSTGCACA 10440 
CTKCGCGAT GWGTAGGA GGCGTATR] TGTACGG^ (XTACC*GT TTCGAGATC 10500 

TUTAGTiGAA CGGrGTCACA CCAGGAACGu" CTAA£GAC77 QAAAGTCATA GOTGACCAA 10550 

TTOGCATC CTTTACGCCA T7CGATCATA ASIiSTT AT CCATCGCGGC CTGGnUTAGA 10620 

COESATAT MZflGZZ: GTOSSM AT7DVSJA 1C550 

CCTCCTTbAC TAGCAAGGAT C7CATCGCCA GMAOCAT AffiCCTTCCG 1074O 

CG^GVCGT GCATbTCCCG TACSCGMG CCTCATQiGG ATT1GAGATG TGGWAACA 108C0* 

TCCGAGCGGT GGACTG7TCA TACGSGACA ThZCCATTTC TATTl^GATC CCGAACGCTG 10920 

CCTTTATCnj GACATCAGAT GC^CACTGG TUTCA^AGT (AMTGiGAA GTCAG7GAGT 10960 

GCCTTATTU AGC^GAOTC GGCGGGAIGG CC^CCCTGCA (JTATGTATCC G/^CCGCGA^G U040 

GtTCAATCCCC CGTAGA1TCG CATTCGAGCA C^GCAACTCT CCA^GTCG AG^GTACATC 11100 

TCCTGG^GAA AGGAGCGGTC -C^CACT TO2CACCGC GCGAAC7T7A 11160 

TCGTATCGCT GTGiGGGAAG .WWCAT GCAATGC^GA ATCTAAACCA CCAGGGACC 11220 

ATATCGTGAG CACCCCGC^C AAWTG^CC A^ATTTCA PGDCGCCATC TCAAiAACAT 112S0 

CATCG^TIb GCTUiTTGCC C17TTCGGCG GZCCTGZC GCTATTMTT ATAGGACTTA 11340 

TGATFTTTCC TTGC^GCATC ATCCIGACTA GCACACGAAG ATCACCGC7A CGCCCCAATG 11400 
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ATO^CMAACra 1146Q 

GT^TOGA TCCCOGCTTA C0GCGQ3CAA TATA3CWA (TAWCTC GATUTOTt 11520 

CGf a GGA fl GCG G^GTGCATM 7I£TGCGC/iG IGTTCCC^ XWCCOVT ATTACCATT 11580 

TAIUTAGCQS £G3C£AA4M {XAA7GTAT TTUIGflGG^A QDGtTQJTHIA TM7I3XAOG 11640 

C^GCGTOjC ATA^CTTTTA TTATTTUTTT TATTMTCM 0WV\TTTTG TTTTTMIAT 11700 

TTOAWM AW AW M AWWVWA AVWVWW\ 11740 
(2) INFCRMAJICN FCR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: anrino acid 

(0 STRANCEENESS: single 
(D) TTPCL0GY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
Sen He Lej Gly Ser Arg 



C2J INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARCTERISTICS : 
CA) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TCPCLCGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1C5: 
GTCCGTTTGT CGTGOVO] C 



Zl 
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(2) INFORMATION FOR SEQ 10 NO: 106: 

(i) SECUENCE QWACMISnCS: 
(A) LENGTH: 21 base pairs 
CB) TYPE: nucleic acid 
(0 STTWEDNESS: single 
(D) TDFOLGGY: linear 



(xi) SEQUENCE DESCHIPTTCN: SEQ ID ,VJO:106: 
GTCC£uTrGT CG70CA/OG A 
C2) INFORMATION FOR SEQ ID N0:107: 

(i) SEQUENCE C4ARCTERISTTCS- 

CA) LENGTH: 21 base pairs 

CB) TYPE: nucleic acid 
(0 S7MCECNESS: single 
(D) TOPOLOGY ; linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
CAATOTTOCT CKCCOTAG 0 
(2) INFORMATION FCR SEQ ID NO: 108: 

(i) SEQUENCE CBWjERISTICS: 

CA) LENGTH; 21 base pairs 

CB) TYPE: nucleic acid 
CO STRANCECNE5S: single 
CD) TOPOLOGY: linear 



Cxi) SECUENCE DESCRIPTION SEQ ID NO: KB; 
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CAATCTTCCT CXHCYTPG T 

(2) INFORMATION FOR 5EQ ID NO: 109: 

(i) SEQUENCE OWOERISTTC: 

(A) LENGTH: 21 base pairs 

(B) TVPE: nucleic acid 
(G S77WDECNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1G9: 

TOGCATrGT A 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARCTtRISuG: 
(A) LENGTH: 21 base pairs 
CB) TYPE: nucleic acid 
(0 S1WICEEMES5: single 
CD) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

lujmm tcagcawt t 

(2) INFORMATION FOR SEQ ID NO: 111: 

Ci) SEQUBJCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
CB) TYPE: nucleic acid 
CO STRANGENESS: single 
CD) TOPOLOGY: linear 
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Cxi) SBQUENCE DESCRIPTION: SEQ ID NO: 111: 
TATA7UCG4 GGCTtialGiT GTAGTATTAG TOG 
(2) INFORMATION FOR SEO ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
CD J TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 
TATATGATAT CAAAAAGCCT GAACTCACCG CGACG 
(2) INFORMATION FOR SEQ ID NO: 113; 

0) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 35 base pairs 
(8) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION': SEQ ID N0:113: 
ATATAGGATC CTCAGTTAGC CTCCCCCATC TCCCG 
C2) INFORMATION FOR SEQ ID NO: 114; 

(i) SEQUENCE CHARACTERISTICS- 

CA) LENGTH; 120 amino acids 

CB) TYPE: amino acid 
CO STRANDEDNESS: 
CD) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 114; 

Met Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
15 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro Thr Pro Thr 
20 25 30 

Met Val Tnr Pro Met- Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met- 
35 40 45 

Gin Gin Leu He Ser Ala Val Ser Ala Leu Thr Thr Lys Gin Asn Val 
50 55 60 

Lys Ala Pro Lys Gly Gin Arg Gin Lys Lys Gin Gin Lys Pro Lys Glu 
°5 70 75 80 

Lys Lys Glu Asn Gin Lys Lys Lys Pro Thr Gin Lys Lvs Lys Gin Gin 
85 90 * 95 

Gin Lys Pro Lys Pro Gin Ala Lys Lys Lys Lys Pro Gly Arg Arg Glu 
100 105 110 

Arg Met Cys Met Lys He Glu Asn 
115 120 

(2} INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : 
CD)' TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 115; 

Met Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
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15 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro TTir Pro Thr 
20 25 30 

Met Val Thr Pro Met Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met 
35 40 45 

Gin Gin Leu He Ser Ala Val Ser Ala Leu TTir Thr Lys Gin Asn Val 
50 55 60 

Lys Ala Pro Lys Gly Gin Arg Gin Lys Lys Gin Gin Lys Pro Lys Glu 
65 70 75 80 ■ 

Lys Lys Glu Asn Gin Lys Lys Lys Pro Thr Leu Lys Arg Arg Glu Arg 
85 go 95 

Met Cys Met Lys He Glu Asn 
100 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 88 amino acids 

CB) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Met Asn Tyr lie Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
1 5 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro Thr Pro Thr 
20 25 30 

Met Val Thr Pro Met Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met 

35 40 45 

Gin Gin Leu lie Ser Ala Val Ser Ala Leu Thr Thr Lys Gin Asn Val 
50 55 60 
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Lys Ala Pro Lys Gly Gin Arg Gin Lys Lys Gin Leu Lys Arg Arg Glu 
65 70 75 80 

Arg Met Cys Met Lys He Glu Asn 
85 

(2) INFORMATION FOR SEQ ID NO: 117: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 amino acids 
(8) TYPE; amino acid 
CO STRANDEDNESS : 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Met Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
15 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro Thr Pro Thr 
20 25 30 

Met Val Thr Pro Met Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met 
35 40 45 

Gin Gin Leu He Ser Ala Val Ser Ala Leu Thr Thr Lys Gin Asn Leu 
50 55 60 

Lys Arg Arg Glu Arg Met Cys Met Lys He Glu Asn 
65 70 75 

(2) INFORMATION FOR SEQ IO NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
CB) TYPE: nucleic acid 
CO STRAN0EDNESS: single 
CD) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEG 10 NO: 118: 
ATATAGGATC CTTCGCATGA TTGAACAA6A TGGATTGC 



38 
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